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THE PARTIAL REINFORCEMENT EFFECT SUSTAINED 
THROUGH BLOCKS OF CONTINUOUS 
REINFORCEMENT! 


JOHN 


PHEIOS ? 


Stanford University 


The discrimination interpretation 
of partial reinforcement (Bitterman, 
Fedderson, & Tyler, 1953; Mowrer & 
Jones, 1945) holds that resistance to 
extinction is a function of the degree 
of similarity between the training 
reinforcement schedule and the con- 
tinuous nonreinforcement of extinc- 
tion. According to this point of view, 
it is relatively easy for continuously 
reinforced Ss to discriminate when 
extinction starts because of the sharp 
contrast in the reinforcing events at 
the onset of extinction. Partially 
reinforced Ss have difficulty in dis- 
criminating the transition from train- 
ing to extinction because they have 


1 This study is based on a dissertation sub- 
mitted to the faculty of Stanford University 
in partial fulfillment of the requirements for 
the PhD degree. The writer wishes to express 
his gratitude to the chairman of his disserta- 
tion committee, Douglas H. Lawrence, and 
to Gordon H. Bower for their cogent advice 
and generous aid during all stages of the 
development of this study. Thanks are also 
due to Ernest R. Hilgard and Patrick Suppes 
for their valuable comments during the later 
stages. The research was conducted while 
the writer was a National Science Foundation 
Cooperative Graduate Fellow. 

? Now at the University of Texas. 


experienced runs of nonreinforced 
trials, and the first few extinction 
trials provide a stimulus situation 
which is similar to that experienced 
during the nonreinforced 
training. 

In stimulus generalization terms, 
continuously reinforced Ss experience 
a large stimulus change during extinc- 
tion which results in a large decre- 
ment in response strength. Partially 
reinforced Ss experience only a small 
stimulus change, and their response 
strength is decreased only slightly. 
The difference in resistance to extinc- 
tion is attributable to the differential 
stimulus generalization decrements. 

A possible test of the discrimination 
hypothesis would consist of varying 
the number of continuously rein- 
forced trials interpolated between 
an initial partial reinforcement series 
and an extinction series. The con- 
tinuously reinforced trials would serve 
to isolate the partial reinforcement 


runs in 


_ training from extinction by permitting 


the partially reinforced Ss to have a 
sharp contrast in the reinforcing 
events at the onset of extinction 
similar to that experienced by con- 
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tinuously reinforced Ss. This stimu- 
lus change experienced by Ss that 
have received some continuous rein- 
forcement following partial reinforce- 
ment should be larger than the change 
experienced by Ss that received only 
partial reinforcement, and according 
to the discrimination hypothesis, the 
former Ss should extinguish faster 
than the latter Ss. In general, the 
longer the block of continuous rein- 
forcement given after partial rein- 
forcement, the less similarity there 
should be between training and ex- 
tinction. Therefore, resistance to 
extinction is expected to decrease 
with the number of continuously 
reinforced trials interpolated between 
partial reinforcement and extinction. 
As the number of interpolated trials 
becomes large, resistance to extinction 
should approach that of continuously 
reinforced Ss. The present experi- 


ment was designed to test these pre- 
dictions from the discrimination theory 
of partial reinforcement. 


METHOD 


Apparatus.—The apparatus was a black 
wooden runway 5 in. wide, 5 in. high, and 
99 in. long. The first 12 in. comprised the 
start box and the last 15 in. the goal box 
rhe start and goal boxes were separated from 
the alley by metal guillotine doors. A 
1.8 X 2.5 in. food cup was concealed behind 
a 1.6-in. metal shield at the end of the goal 
box. The presence of the shield required S 
to put his head into the food cup to see its 
contents. 

Response time was measured in .01 sec. 
from the opening of the start door until S 
interrupted a photobeam 5 in. inside the goal 
box. On rewarded trials a food pellet was 
automatically delivered by a solenoid feeder 
when S interrupted a second photobeam 
positioned across the food cup. The reward 
consisted of one “I rostyos”’ pellet, a sugar- 
coated breakfast cereal produced by General 
Mills, Incorporated. On nonrewarded trials 
the solenoid operated, but no food was de- 
livered. The Ss required about 15 sec. to 
consume the pellet, and were removed to the 


home cage 15 sec. after the interruption of 
the first photobeam. 

Subjects—The Ss were 60 naive Slonaker 
albino rats, 28 males and 32 females, selected 
from the colony maintained by the Stanford 
University Psychology Department. They 
were approximately 90 days old at the be- 
ginning of the experiment and had been 
tamed for 3 days prior to the experiment. 
The Ss were housed in individual cages with 
ad lib. water and were maintained on 22 hr. 
food deprivation throughout the experiment. 
The Ss were permitted to eat lab chow for 
2 hr. after each daily experimental session. 

Procedure.—After taming, S was placed 
into the goal box for 5 trials. Reward pellets 
were scattered throughout the goal box on the 
first 2 trials, but only the food cup contained 
pellets on the last 3 trials. When S ate one 
pellet he was removed to his home cage. 
Preliminary runway training followed on the 
next 6 days at 5 trials a day. The 30 pre- 
liminary runway trials were continuously 
rewarded for all Ss. The Ss were run in 
rotation, and the intertrial interval decreased 
from approximately 60 to 
this preliminary training. 

After preliminary training, Ss were divided 
into five groups, equated on performance 
during the preliminary 30 trials. During 
the remaining part of the experiment Ss 
received 5 runway trials each day. Three 
groups of 14 Ss each received 40% partial 
reinforcedment (P) for 70 trials. The sched- 
ule was random except for the restrictions 
that runs of longer than 6 rewarded or non- 
rewarded trials were excluded and that the 
series ended with a rewarded trial. One of the 
P groups was reduced to 13 Ss when a rat 
died during the experiment. The remaining 
two groups, each consisting of 9 Ss, received 
continuous reinforcement (C) for 70 trials. 
All Ss were left in the goal box for 15 sec. 
on each trial during this phase, irrespective 
of whether the trial was rewarded or not. 
The intertrial intervals 
10 min. 

Groups P-O and C-O were extinguished 
following the last of the 70 training trials. 
Group P-25, however, received 25 additional 
100% rewarded trials, and then was extin- 
guished. Groups P-70 and C-70 were run 
for 14 additional days on 100% reward, 
receiving 70 additional continuously rewarded 
trials before they were extinguished. 


30 min. during 


averaged about 


During extinction S received 5 trials each 
day until he accumulated 5 trials, not neces- 
sarily consecutive, during which he did not 


enter the goal box within 60 sec. In order to 
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obtain extinction curves based on all Ss, 
each S was run for at least 40 trials. If S did 
not enter the goal box within 60 sec., he was 
removed to his home cage and given a score 
of 60 sec. If S entered the goal box, he was 
detained in the box for 15 sec. The empty 
feeder operated if S looked into the food cup. 
The intertrial interval averaged about 10 min. 


RESULTS 


Training.—Curves of response speed 
during training are presented in Fig. 1. 
An S’s median response speed for each 
block of 5 trials was computed. Each 
point on the curves represents a group 
mean of these median scores for a 
given block of trials. The Ss were 
divided into five groups just before 
Block 7. An analysis of variance on 
the median speed scores of Block 7 
yielded an F< 1.0. Hence, the 
division of Ss did result in comparable 
groups. 

The curves that between 
Blocks 7 and 20, the trials on which 
the P Ss received 40% reinforcement, 
the continuously reinforced Ss even- 


show 


tually ran faster than the partially 


reinforced Ss. The means and SDs of 
Ss’ median speeds during Block 20 
are presented in Table 1. An analysis 
of variance performed on these scores 
indicated that there was no significant 
difference between 
ously reinforced 

df 1/54), nor 


the two continu- 
groups (F = .28, 
the 


among three 


sponse speeds during training. 


TABLE 1 


MEANS AND SDs or SPEED SCORES ON 
BLock 20 


Group 
P-O0 
P-25 
P-70 


C-0 
C-70 


partial groups (F = 1.14, df = 2/54). 
However, the partial groups ran 
significantly slower than the 100% 
groups (f=7.16, df=1/54, P<.01). 

After Block 20, when Groups P-25 
and P-70 were changed from 40% 
reinforcement to 100%, their response 
speeds increased essentially to that 
of the C-70 Ss. Since Group P-25 
was extinguished following Block 25, 
a comparison of the P groups and 
Group C-70 was made by an analysis 
of variance on the Block 25 median 
speed scores. There was no signifi- 
cant difference between the two P 
groups (Ff = 1.13, df = 1/34) nor 
between the combined P groups and 
Group C-70 (F = 1.00, df = 1/34). 
Thus, while on 40% reinforcement, 
the P Ss ran significantly slower than 
the 100% Ss. When the P Ss were 
subsequently given 100% reinforce- 
ment, however, their response speeds 
significantly increased so that the 
initial difference between the 40% 
and 100° Ss disappeared in less than 
25 trials. 

Extinction.—Curves of response 
time during extinction are given in 
Fig. 2. Each point on the curves 
represents a group mean of individual 
median time scores for a given block 
of 5 extinction trials. The striking 
feature of Fig. 2 is the difference be- 
tween the extinction curves of the P 
and C groups. The three P groups, all 
of which initially received 40% rein- 
forcement, have very similar extinc- 
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tion curves in spite of the fact that one 
group had 0, the second group 25, 
and the third 70 trials of 100% rein- 
forcement between partial reinforce- 
ment and extinction. The curves of 
the two C groups are also very similar 
to each other. The difference be- 
tween the P and C groups is evident 
as early as the second block of 5 
extinction trials and indicates much 
faster extinction for the C groups. 
Thus, additional trials of 100% rein- 
forcement following 40% 
ment had little 
extinction. 


reinforce- 
effect on rate of 

The large difference in extinction 
rate between the P and C groups 
evidenced by the curves is also re- 
flected in the mean number of trials 
to reach the extinction criterion 
(Table 2). All three P groups took 
at least 50 trials to reach the criterion, 
but the C groups took only about 32 
trials. It can be seen that the SDs 
for the C groups are much smaller 


than those for the P groups, making 


inadvisable the use of an overall 
analysis of variance on the responses 
to the criterion scores. For Group 
C-0 and Group C-70, a t of .78 was 
obtained, indicating that, for 100% 
reinforced Ss, a difference between 
100 and 170 training trials had no 


50 
40} 


30 


RESPONSE TIME (SEC.) 





| 2 3 
DAILY BLOCKS OF 5 


Fic. 2. 


EXTINCTION TRIALS 


Response times during extinction. 


TABLE 2 


aND SDs or RESPONSES 
EXTINCTION CRITERION 


MEANS 


effect on resistance to extinction. 
Homogeneity of variance among the 
responses to the criterion for the three 
P groups permitted a 1 X 3 analysis 
of variance. The obtained F of 6.26 
(df = 2/38) is significant at the .01 
level, and indicates that individual 
t tests are appropriate. Using the 
overall within-groups variance as the 
variance estimate for the error term 
of the ¢ tests, Groups P-0 and P-25 are 
not significantly different from each 
other (¢ = .88). The ¢t between 
Groups P-0 and P-70 of 2.46, however, 
is significant (df=25, P<.05). Simi- 
larly, the t between Groups P-25 and 
P-70 of 3.40 is significant (df = 26, 
Pr < M4). 

In order to demonstrate that all 
the P groups are significantly more 
resistant to extinction than the C 
groups, a median test was run between 
Group P-70, the least resistant of the 
P groups, and the combination of the 
two C groups. The obtained x? of 
24.89 is highly significant (P < .001, 
df =1). There is almost no overlap 
in the two distributions. This large 
difference was obtained in spite of the 
fact that Group P-70 had 70 con- 
tinuously rewarded trials just prior 
to extinction. 


DISCUSSION 


The present results indicate that the 
partial reinforcement effect (PRE) of 
increased resistance to extinction is rela- 


tively unaffected by continuous rein- 
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forcement following the partial reinforce- 
ment. After the present experiment was 
completed, a study by Quartermain and 
Vaughan (1961) was published which 
bears on the same issue. Using lever 
pressing in a Skinner box, they com- 
pared resistance to extinction of rats 
given acquisition training with either 
800 responses of which a random 10% 
were reinforced, or 400 responses at 10% 
reinforcement followed by 400 continu- 
ously reinforced responses. Their results 
were entirely consistent with those re- 
ported here. These findings demonstrate 
that discriminability of the transition 
from training to extinction cannot be 
taken as explaining much of the partial 
reinforcement effect. 

Although it could be that 
discrimination has some small effect since 
in the present study Group P-70 ex- 
tinguished slightly faster than Group 
P-0, the fact that there was no difference 
in resistance to extinction 
Groups P-0 and P-25 
problem for the discrimination hy- 
It might be argued that 25 
trials were not sufficient for Ss to have 


assumed 


between 
presents a 


pothesis. 


noticed a change in the reinforcement 
and, difference be- 
tween Groups P-0 and P-25 should not 
been 


schedule, hence, a 
However, the 
schedule change clearly influenced run- 
ning performance. The response speeds 
of Group P-25 and P 
markedly during the 25 continuously 
reinforced trials, eventually equaling the 
This 
increase in performance can be taken 
as evidence that the Ss ‘“‘noticed’’ the 


have expected. 


-70 Ss increased 


speeds of the Group C-70 Ss. 


change. 

Another proposal might be that Group 
P-25’s extinction could 
have been increased by the fact that it 
had 25 more trials than Group P-0. 
However, Group C-70 had 70 more trials 
than did Group C-0, and these C groups 
did not differ in resistance to extinction. 


resistance to 


Thus, with the degree of training given 


in the present experiment, additional 


trials had no effect on resistance to 


extinction. 


The present experiment can be added 


to a growing amount of empirical data 
indicating that the discrimination hy- 
pothesis cannot adequately account for 
the PRE. Marx (1958) has shown that 
if certain significant features of the goal 
situation (i.e., the food cup) are omitted 
on half of the extinction trials, rate of 
extinction is decreased rather than 
increased as would be predicted from a 
discrimination analysis. The discrimina- 
tion theory has been most successful in 
accounting for the fact that rate of ex- 
tinction usually varies directly with the 
percentage of reward when number of 
trials or number of reinforcements are 
held constant. However, using a design 
which pitted percentage of reinforcement 
against number of nonreiuforcements, 
Lawrence, Festinger, and Theios (sum- 
marized by Festinger, 1961) found that 
resistance to extinction was independent 
of the reinforcement percentage, but was 
determined primarily by the number 
of nonreinforced trials Ss experienced 
during training. 

The present data as well as these other 
studies suggest that the discrimination 
hypothesis is inadequate as an interpre- 
tation of partial reinforcement. The 
present data indicate that an adequate 
theory of partial reinforcement must 
have constructs representing relatively 
permanent effects of nonreinforcement 
which can be sustained through blocks 
of continuous reinforcement. At least 
four existing theories of partial rein- 
forcement employ constructs of this 
character. These theories include Amsel 
(1958), Estes (1959), Logan (1960), and 
Festinger (1961). The 
do not enable one to discriminate among 
these theories. 


present results 


SUMMARY 


Three groups of 14 rats each received 70 
trials of random 40% reinforcement in a run- 
way. Then different numbers of continuously 
reinforced trials were interpolated between 
the 40% reinforcement and extinction. Group 
P-0 received none, Group P-25 received 25, 
and Group P-70 received 70 interpolated 
100% reinforced trials. Two control groups 
(N = 9 each) received only continuous rein- 
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forcement during training: Group C-0 was 
extinguished with partial Group P-0; Group 
C-70 was given 70 additional reinforced trials 
and extinguished with Group P-70. 

There was no significant difference in 
resistance to extinction between the two 
continuously reinforced groups. Group P-70 
was very significantly (P < .001) more re- 
sistant to extinction than the twocontinuously 
reinforced groups. There was no significant 
difference in resistance to extinction between 
Groups P-0 and P-25, but both of these groups 
were slightly more resistant than Group P-70. 
These results question the adequacy of the 
discrimination hypothesis that the partial 
reinforcement extinction effect results from 
difficulty in discriminating the transition 
from training to extinction. 
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THE EFFECTS OF LANGUAGE HABITS ON THE 
ACQUISITION AND RETENTION OF 
VERBAL ASSOCIATIONS! 


LEO POSTMAN 


University of California 


The experiments reported in this 
paper investigate the effects of unit- 
sequence habits on the acquisition 
and retention of verbal materials. 
By unit-sequence habits we refer to 
associative connections between ver- 
bal items established through linguis- 
tic usage (Underwood & Postman, 
1960). The transfer effects of such 
language habits may be either positive 
or negative, depending on the rela- 
tionship between S’s pre-experimental 
associations and those prescribed in 
the learning task. 


The amount of transfer from pre- 
experimental habits should increase with 
the frequency of usage of the verbal units 
in the language. The higher the fre- 
quency of usage, the larger on the 
average is the number of different con- 
texts in which an item is likely to appear 
and hence the larger the number of 
different associations which it acquires. 
Conventional indices of meaningfulness 
such as association 


value and m value 


do in fact correlate highly with frequency 
of usage (Postman, 1961; Underwood & 
Schulz, 1960). 


Both unit-sequence facili- 
unit-sequence interference 
may then be expected to increase as a 
function of frequency of usage. When 
the sequence to be learned 
high-frequency items, at least some 
of the prescribed associations are likely 
to agree 


tation and 


involves 


with pre-experimental habits 
readily established through 
direct mediational links. However, the 
number of pre-experimental associations 
which can compete with the prescribed 
will function 


or can be 


ones also increase as a 
1 This research was supported by a grant 
from the National Science Foundation. 


of frequency of usage. With low-frequency 
materials, the prescribed sequences will 
have little pre-experimental strength and 
few direct mediators will be available, but 
there will also be few pre-experimental 
associations to produce interference. 
The conditions of 
interference should influence recall as 
well as acquisition. When there is unit- 
sequence facilitation, it is assumed that 
the strength of the pre-experimental 
habits has generalized to the prescribed 
associations within the list. The greater 
the generalized habit strength, the more 
resistant the items in the experimental 
list will be to interference. When there 
is unit-sequence interference, it is as- 
sumed that competing pre-experimental 
habits have unlearned during 
acquisition and have recovered over time 
to become 
recall. 


facilitation and 


been 


sources of interference at 


An earlier study (Postman, 1961) 
investigated the acquisition and retention 
of serial lists of words of high and low 
frequency of usage (Lists HF and LF). 
List HF was learned faster than List LF. 
At the time, rate 
of misplaced responses was substantially 
greater for List HF. The positive rela- 
tionship between speed of learning and 


same however, the 


error rate points to covariation of unit- 
facilitation and interference 
as a function of word frequency. There 
were differences in the 
for the two kinds 
However, misplaced re- 

recall recovered at a faster 
List HF than for List LF. 
While there was no evidence for superior 


sequence 


no significant 
amount of retention 
of materials. 
sponses at 
rate for 


retention of materials of high meaning- 
fulness, the original expectation of an 
inverse relationship between word fre- 
quency and recall borne out. 
This expectation had been based on the 


was not 
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assumption that competition from extra- 
experimental associations would increase 
directly with word frequency. The dif- 
ference in recovery of errors is in accord 
with this assumption. 
fact that the amounts recalled were 
comparable in spite of a substantial 
difference in the incidence of overt 
that increases in unit- 
sequence interference may be offset in 
part by unit-sequence facilitation. 

The present experiments extend the 
analysis of the conditions of unit-se- 
quence facilitation and interference. A 
first step was to vary the word frequency 
of stimuli (Ss) and responses (Rs) in a 
study investigating the acquisition and 
retention of paired associates (Exp. I). 
There is considerable evidence that speed 
of learning varies directly with the 
meaningfulness of Ss and Rs, and that 
this relationship is considerably 


However, the 


errors suggests 


more 
pronounced and consistent for Rs than 
for Ss (Cieutat, Stockwell, & Noble, 
1958; Hunt, 1959; Kimble & Dufort, 
1955; L’Abate, 1959; Mandler & Camp- 
bell, 1957; Morikawa, 1959; Underwood 
& Schulz, 1960). In these experiments 
the units scaled for meaningfulness (e.g., 
trigrams or paralogs) were either non- 
sense items or, more usually, included 
both nonsense items and words. Letter- 
sequence habits as well as unit-sequence 
habits contribute to the differences in 
speed of learning obtained with such ma- 
terials (Underwood & Schulz, 1960). 
The lists used in Exp. I consisted entirely 


of words so that variations in speed of 
learning and amount of retention could be 
attributed primarily to the influence of 
unit-sequence habits. 


It was expected 
that both unit-sequence facilitation and 
unit-sequence interference would increase 
with word frequency, with the relative 
amounts of interference growing more 
rapidly as a function of S frequency than 
of R frequency. To the extent that 
paired items are practiced in the order 
of their presentation, competing asso- 
ciations elicited by the Ss should be 
more effective in delaying acquisition 
than those evoked by the Rs. 
considerations that 


rise to a 


These 
suggest 
should 


speed of 


learning maximum 


and then decline as the word frequency 
of Ss is increased. On the other hand, 
speed of learning should vary directly 
with the word frequency of Rs. 

No prior studies of the effects of S 
and R meaningfulness on the long-term 
retention of paired associates are avail- 
able. If the associative context of the 
Ss rather than the Rs is critical in deter- 
mining the amount of effective competi- 
tion at recall, differenées in the amount 
of retention should be determined largely 
by the degrees of unit-sequence facilita- 
tion and interference: falling on the S 
terms. Thus, the differences among S 
conditions in learning should predict 
recall better than among the R 
conditions. 

According to the present analysis, a 
sample of high-frequency Ss will activate 
strong pre-experimental habits whereas a 
sample of low-frequency Ss will not. It 
follows that manipulation of the condi- 
tions of facilitation and interference 
within a list should be more effective 
with low-frequency Ss than with high- 
frequency Ss. 


those 


When pre-experimental 
probabilities are low, the 
introduction of otherwise unlikely sources 
of facilitation and interference based 
on language habits should produce major 
effects on learning and retention... On 
the other hand, when the associative 
probabilities are high, the introduction 
of known associates or competitors 
should add relatively little to the 
amounts of facilitation and interference 
from uncontrolled This pre- 
diction was tested in Exp. II in which 
the pre-experimental probability of the 
prescribed associations was varied for 


associative 


sources. 


Ss of high and low word frequency. 
Experiment III was designed to pro- 
vide a further test of the assumption 
that the higher the word frequency of a 
series of items, the greater is the amount 
of overlap among their associative con- 
texts. The method of verbal discrimina- 
tion learning was used. To the extent 
that discriminability depends on the 
availability of differential responses to 
the members of a pair, speed of discrimi- 
nation learning should increase with the 


meaningfulness of the paired items. 
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TABLE 1 


STIMULUS AND RESPONSE TERMS IN Exp. I 


Sb 





BUILDING 
COUNTRY 
DOCTOR 
GARDEN 
HUSBAND 
LETTER 
MORNING 
PAPER 
STORY 
WINDOW 


ANSWER 
COLOR 
DINNER 
FIGURE 
MOMENT 
PROBLEM 
REASON 
SHOULDER 
TABLE 
WOMAN 


ARBOR 
BURLAP 
CINDER 
DISCORD 
GROCER 
LOTION 
MAGNATE 
OATMEAL 
TRAITOR 
WHISKER 


Such a relationship was found with pairs 
of nonsense syllables by 
Freeman (1960). 
associates elicited by the items increase 
in number, they should begin to lose 
distinctiveness because of the increasing 
probability of 
context. 


Runquist and 
However, as the 


associative 
Acquisition will then require 
the unlearning of differential responses 
which produce errors of generalization, 
but such 


overlap in 


responses be expected 
with the passage of time. 
Thus, speed of acquisition and amount 
of retention for the verbal discrimination 
task should first and then 
decrease as a function of word frequency. 


may 
to recover 


increase 


EXPERIMENT | 


“Method 


Materials —The learning materials were 
two-syllable nouns of high, medium, and low 
frequency of usage. Each frequency range 
was represented by two lists of 10 words 
each, one of which served as the S list and the 
other as the R list (Si, Sm, Si; Rn, Rm, Ri). 
[hese materials were used in the construction 
of nine paired-associate lists representing all 
the possible combinations of word frequencies 
of the S and R items. 
shown in Table 1. 


The lists of words are 


rhe lists sample three frequency ranges in 
the “‘L”’ count of Thorndike and Lorge (1944). 
The numbers of occurrences in 4.5 million 
are 1000-3300, 10-33, and 1-3 for words of 
high, medium, and low frequency, respec- 
tively. The mean rated familiarity and m 
values, which were obtained in prior standard- 
ization procedures, are correlated with fre- 
quency of usage. The words were chosen 


BRAMBLE 
CAUCUS 
DECOY 
FARTHING 
GULLET 
LORRY 
OXIDE 
PESTLE 
SEQUEL 
WICKET 


BASIN 
DOGMA 
FETISH 
HERMIT 
MINSTREL 
OMEN 
RELIC 
SUFFRAGI 
TEMPEST 
WAFER 


ABBESS 
BUFFOON 
DOTAGI 
HAREM 
OBO! 
PREFIX 
RAMROD 
STANZA 
rENURE 
WAMPUM 


so as to minimize pre-experimental associa- 
tions between items in the S lists and items 
in the R lists. Norms of free association 
obtained from 1000 students at the University 
of California were used for this purpose. 

There was no duplication of first letters 
in any of the S or R lists. For all nine combi- 
nations of word frequencies the S list and the 
R list had six first letters in common. Four 
different pairings of Ss and Rs were used 
equally often with each of the nine lists 

Procedure.—The lists were presented on a 
Hull-type memory drum at a 2:2 rate (S 
alone for 2 sec. and S and R together for 2 

with an 8-sec. intertrial interval. 
Learning was to a criterion of one perfect 
trial. Four different orders of presentation 
were used to minimize serial learning. Each 
of the four random orders was used as a start- 
ing order equally often. 

Retention was tested by relearning either 
30 sec. or 7 days after the end of original 
practice. Relearning was for five trials or to 
criterion, whichever took the longer. 

Subjects.—With nine types of lists and two 
retention intervals, there were 18 groups of 
16 Ss each. All Ss 
students who had 


sec.), 


were undergraduate 
English as their native 
language and were naive to rote-learning 
experiments. The Ss were assigned to condi- 
tions in blocks of 18, with 1 S per block for 
each combination of lists and retention inter- 
vals. The running order within each block 
was determined by a table of random num- 
was the assignment to different 
pairings and starting orders. No Ss were 
lost because of failure to learn. 


bers, as 


Results 


Original learning. 
significant differences 


There were no 
between the 
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TABLE 2 


MEAN NUMBERS OF TRIALS TO CRITERION: 
Exp. I 


10.5 | 6.3 
12.9 | 6.0 
16.2 | 8.5 


Mean! SD | Mean | 


Over-all | 
Mean | 13.2 
! 


30-sec. and 7-day groups learning a 
given type of list. The results of the 
two groups learning the same materi- 
als have, therefore, been combined. 
Table 2 shows the mean trials to 
criterion for each of the nine kinds of 
lists and also summarizes the average 
trends produced by the word fre- 
quency of the Ss and Rs. Speed of 
learning first increases and then de- 
creases as a function of the word 
frequency of Ss but varies directly 
with the word frequency of Rs. 
The over-all effects of the word fre- 
quency of both Ss and Rs are signifi- 
cant beyond the .01 level (F = 6.15 


for Ss and 7.24 for Rs, df = 2/279 
for both). The S X R interaction is 
not significant (F=1.54, df =4/279). 

Errors during learning.—Table 3 
shows the mean percentages of mis- 
placed Rs during learning. Such 
responses account for the great major- 
ity of overt errors. The percentages 
are based on opportunities (total 
number of presentations minus num- 
ber of correct responses) and thus 
are independent of speed of learning. 

For both Ss and Rs the mean per- 
centages of errors are clearly larger 
when word frequency is high than 
when it is medium or low. There 
are only small differences between 
the latter two conditions. Following 
arc-sine transformation and with 2/279 
df, the F ratio for Ss is 12.44 (P <.001) 
and 3.11 for Rs (.02 < P < .05). 
There is also a significant S KX R 
interaction (F = 2.62, df = 4/279, 
.02 < P < .05). When the word 
frequency of either the Ss or the Rs 
is high, the percentage of errors re- 
mains uniformly large, regardless of 
the value of the other variable. 

Table 3 also shows for each type of 
list the product-moment correlation 
between trials to criterion and per- 
centage of misplaced Rs during learn- 


TABLE 3 


MEAN PERCENTAGES OF MISPLACED RESPONSES AND CORRELATION BETWEEN 
PERCENTAGES AND TRIALS TO CRITERION: Exp. I 


Stimulus Terms 





R Terms 





17.5 
12.9 
15.3 





Over-all 
Mean | 








10.1 
12.1 

















SP < 0. 
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TABLE 4 


MEAN NUMBERS OF ITEMS RECALLED IN Exp. | 


30-Sec. Recall 


Stimulus Terms 


Mean SD 
Rp 
Res 
R, 
Over-all | 
Mean | 9.1 


| 


ing. All the correlations are positive, 
i.e., the higher the error rate, the more 
slowly is criterion reached. This 
relationship is especially pronounced 
and statistically significant for Cond. 
Sin and Ry. 

The inverse relationship between 
error rate and speed of learning found 
for individuals within groups clearly 
does not hold for materials. On the 
R side, both speed of learning and 
error rate are maximal for Cond. R,; 
on the S side, error rate is maximal, 
and speed of learning is intermediate, 
for Cond. S,. Within a given group, 
however, rate of acquisition varies 
with Ss’ susceptibility to interference, 
especially when learning is fast as 
in Cond. S,, and Ry. When correct 
Rs are acquired rapidly, a low per- 
centage of misplaced Rs is favorable 
to attainment of the criterion, i.e., 
failures to respond are eliminated 
more readily than overt errors. When 
correct Rs increase relatively slowly, 
the criterion less sensitive 
to the type of error. 

Recall——Table 4 shows the mean 
numbers of Rs recalled by the various 
groups. The drops on the 30-sec. 
test vary inversely with speed of 
learning although the correlation is 
not perfect. The amounts of reten- 


score is 


7-Day Recall 


Stimulus Terms 


Mean; SD 


recall — mean 
7-day recall) as a function of the word 
frequency of Ssand Rsareshown in Fig. 
1. The losses during the 7-day inter- 
val are smallest for S,, and greatest 
for S,, with S, yielding an inter- 
mediate amount of forgetting. The 


tion loss (mean 30-sec. 


RESPONSES 


Retention loss (30 sec.—7 days) 





J 
High 





Medium 
Log frequency 


Fic. 1. Amounts of retention loss (mean 
30-sec. recall — mean 7-day recall) as a func- 
tion of the word frequency of Ss and Rs 
(Exp. 1). 
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TABLE 5 


NUMBERS OF MISPLACED RESPONSES AT RECALL: 


Recall 


Over-All _ is 


a l 
| Total| N | Total V 


Over all 


[ 
E 
iE 


Note NV = Number of Ss giving 


average difference between S, and 
Sm 1s, however, clearly smaller than 
that between S, and S,,. The trends 
for the three values of R are com- 
parable. There are no consistent 
Variations in retention loss as a func- 
tion of the word frequency of Rs. 
The slight over-all advantage of 
R, is due entirely to the variations 
under Cond. S; and is not present 
at the other two values of S.  Follow- 
ing a Freeman-Tukey square root 
transformation, the retention scores 
were subjected to an analysis of 
variance. The significance of the 
differences in the amount of forgetting 
is assessed by the interactions of time 
(T) with the values of Sand R. Only 
the interaction, S X T, is significant 
(F=3.08, df=2/270, .02<P<.05). 
The F ratios for RXT and SXKRXT 
are .88 and 1.08, respectively. Thus, 
the rank order of the conditions 
is the same for speed of original learn- 
ing and amount of long-term reten- 
tion. On the other hand, the differ- 
ences in rate of acquisition among the 
R conditions are not reflected con- 
sistently in the amounts of retention. 

Misplaced responses at recall.—The 
numbers of misplaced Rs at recall 
and the numbers of Ss contributing 
them are shown in Table 5. Such 


Exp. | 
7-Day Recall 


Stimulus Terms 


Over-All 


17 
9 


misplaced responses 


errors account for only a small frac- 
tion of the total retention losses. 
A large proportion of Ss—99/144 
the 30-sec. groups and 88/144 in the 
7-day groups—failed to give mis- 
placed Rs. The observed frequencies 
indicate a higher rate of recovery 
of intralist intrusions for S, and S, 
than for S,, and also a higher rate 
for Ry than for R,, and R;. In view 
of the high proportion of zero scores, 
the differences in temporal trends 
were not evaluated statistically and 
must be interpreted with caution. 
It is interesting to note, however, 
that the apparent differences in rate 
of recovery are related to the amount 
of forgetting for the S conditions but 
not the R conditions. The low rate 
of intralist intrusions suggests that 
Ss had considerable success in rejecting 
incorrect Rs. 
Relearning.—The 
of trials to criterion in relearning are 
Table 6. The 
amounts of retention loss correspond 


mean numbers 


shown in relative 
the increase 
greater for S; 
and S, than for S,, whereas the word 
little effect. 
S X T, is significant 
df=2/270, .02<P<.05). 


to those found at recall: 
in trials to criterion is 
frequency of Rs _ has 
The interaction, 


(F=4.52, 
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TABLE 6 


MEAN NUMBERS OF TRIALS IN RELARNING TO CRITERION: Exp. I 


30-Sec. Recall 


Stimulus Terms 


Over-all 
Mean 


The F ratios for RXT and SXRXT 
are .01 and .85, respectively. 


EXPERIMENT II 
Me thod 


Materials [here 


paired associates, 


lists of 10 
one with S terms of high 
List S,), and the other with 
S terms of low word frequency (List S;). The 
S terms in both lists were two-syllable nouns 
drawn from the same pools of items the 
words in Exp. I. Half the Rs in each list 
were highly probable associates (A) of the 
S words, i.e., they were the primary Rs to 
these words in ob- 


were two 


word frequency 


das 


free association norms 


7-Day Recall 


Stimulus Terms 


tained from 1000 students at the University 
of California. The other half of the Rs were 
associates of low probability (A;), having 
been given by only 2/1000 Ss in the norma- 
tive group. Thus, there were four types of 
pairs: S,-A, and S,-A, in one list, S,;-A, and 
S,-A; in the other. All the Rs were nouns of 
high frequency of usage falling within the 
AA and A ranges of the General Count of 
Thorndike and Lorge (1944). Within each 
of the four classes of pairs three of the Rs 
were two-syllable words and two Rs were 
one-syllable words. 

There was no duplication of first letters 
in either of the S lists. 
duplicated in each of the two R lists. 


One first letter was 
In both 


TABLE 7 


STIMULUS-RESPONSI 


Assoc. Prob.* 


Pairs Usep IN Exp. Il 


Stimulus Assoc. Prob.* 


List S, 


GARDEN 
MOMENT 
ORDER 
TROUBLI 
WINDOW 


‘= 


NMNmoUt 


FLOWER 
TIMI 
COMMAND 
PROBLEM 
GLASS 


Nee mu 
— uN 


uw 


BUILDING 
COUNTRY 
LETTER 
REASON 
SHOULDER 


SIZI 
VALLEY 
PERSON 
NEED 
BURDEN 


List S; 


CAUCUS 
GULLET 
HAREM 
RAMROD 
STANZA 


156 
180 
340 
258 
282 


MEETING 
THROAT 
WOMAN 
GUN 
MUSIC 


i 


BRAMBLI 
DECOY 
OXIDE 
TENURE 
WAMPUM 


PAIN 
ARMY 
COLOR 
LIFE 
DOLLAR 


NN hw he bh 


*® Number of Ss out of 1000 giving response on test of free association. 
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TABLE 8 


MEAN NUMBERS OF TRIALS TO CRITERION OF 5/5 FOR EACH CLASS OF PAIRS AND 


10 
oups 


List 


Mean 


| 


10 FoR ToTAL Lists IN ORIGINAL LEARNING AND RELARNING: Exp. II 


7-Day Groups 
Stimulus List 


SD Mean 


Original Learning 


An 
Ai 
Total list 


9 
1.0 
1.0 


Relearning 


An 


Ai 
Total list 


lists the S and R terms had six first letters 
in common. Within a list the Rs 
appeared in the association tables for any 
of the stimuli except the with which 
it was paired. Table 7 presents the lists 
and for each pair shows the frequency of the 
R term in the association norms. 
Procedure.—The lists were presented on a 
Hull-type memory drum at a 2:2 rate with an 
8-sec. intertrial interval. to 
one perfect trial. There were four different 
orders of presentation each of which 
used as a starting order equally often. Re- 
tention was tested by relearning either 30 sec. 
or 7 days after the end of original learning 


none ol 


one 


Learning was 


was 


Relearning was for five trials or to criterion, 
whichever took the longer. 

Subjects.—With two lists and two reten- 
tion intervals, the four 
groups of 16 Ss each. All Ss were under- 
graduate students who had English as their 
native language and to rote- 
learning experiments. The Ss were assigned 
blocks of 4, with 1 S per 
block for each combination of lists and reten- 
tion intervals. to conditions 
within a block was determined by a table 
of random numbers as was the assignment to 
different starting orders. No Ss lost 
because of failure to learn 


design comprised 


were naive 
to conditions in 


Assignment 
were 
Results 


Original learning.—The mean num- 
bers of trials to a criterion of 5/5 for 


0 
.6 
.6 


each type of pair as well as to a 
criterion of 10/10 for the entire list 
are shown in Table 8. There were 
no significant differences between the 
30-sec. and 7-day groups learning a 
given kind of list. Pairs S|-A, were 
mastered extremely rapidly, with the 
large majority of Ss reaching the 
criterion of 5/5 in either one or two 
trials. Pairs S,;-A; were learned more 
slowly than Pairs S);-A,. However, 
with half the pairs learned almost 
immediately, acquisition of the entire 
list was very rapid. By contrast, 
Pairs S,-A, were not learned faster 
than Pairs S,-A;; in fact, there was a 
trend in the opposite direction. The 
over-all rate of learning was signifi- 
cantly slower for List S, than for 
List S; (¢ = 4.97, df = 62, P < .01). 
Analysis of variance of the criterion 
scores on the four kinds of pairs shows 
the interaction, Pairs * Lists, to be 
significant (F = 7.08, df = 1/62, 
P<.01). Thus, the association norms 
predict speed of learning for List S,; 
but not for List Sy. 
Errors during learning.—Misplaced 
Rs were much more common during 
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the acquisition of List S, than of List 
S:. Twenty-two of the Ss learning 
List S; made such errors, and only 
6 of the Ss learning List S;. When 
the numbers of misplaced Rs are 
expressed as percentages based on 
opportunities, the mean percentage 
for List S, is 27.6 (SD 20.0; 
Median = 26.8). For List S; the mean 
of the extremely skewed distribution 
is 6.4 (SD 16.1; Median < 1.0). 
The difference between the two dis- 
tributions is significant beyond the 
.01 level by Wilcoxon's test for signed 
ranks, with Ss in the same block 
treated as paired replicates. 

A total of 23 importations from 
outside the list was contributed by 
13 of the Ss learning List S,; 3 of the 
Ss learning list S; contributed a total 
of five such errors. The difference 
between the proportions of Ss making 
such errors is significant beyond the 
.01 level (x? = 6.75; df 1). All 
of the importations appeared in the 
association tables for the appropriate 
stimuli. Of 23 importations into List 
S;, 19 were substitutions of primary 


associates for prescribed A; responses; 


no such occurred the 
learning of List Sy. 

Recall.—The recall scores for the 
four kinds of pairs are shown in Table 
9. When we consider the two types 
of pairs within each list, we find clearly 
superior retention of the high-prob- 
ability pairs in List S,; but no differ- 
ence in List S,. Comparison of the 
corresponding pairs in the two lists 
shows retention to be higher for Pairs 
S;-A, than for Pairs S,-An. By 
contrast retention is poorer for Pairs 
S,-A; than for Pairs S,-A). 

In view 
any variation in 
on the 


cases during 


of the virtual absence of 
the recall 
test, the following 
procedure was used to evaluate the 
statistical significance of the differ- 
ences in the amount of forgetting. 


scores 
30-sec. 


Within each block, the difference 
between the scores obtained by Ss in 
the 30-sec. and 7-day groups was 
determined for each type of pair, and 
the distribution of differences 
subjected to an analysis of variance. 
The over-all difference between Lists 
is not significant (F < 1), whereas 
the interaction of Lists with Pairs is 
significant beyond the .01 level 
(F = 8.59, df = 1/30). The rela- 
tionship between the associative rank 
of the Rs and the rate of forgetting 
varies reliably with the word frequency 
of the Ss. 

Errors at recall——The numbers of 
misplaced Rs at recall were small and 
did not increase during the retention 
interval. The few intralist errors 
that were made occurred almost 
exclusively during the recall of List 
Sp, at the rate of approximately .5 
per S on both tests. By contrast, 
importations from outside the list 
showed a systematic temporal trend. 
On the 30-sec. test there were only 
2 Ss giving such Rs, both on Pairs 
S;-A;. The frequencies of importa- 
tions on the 7-day test and the num- 
bers of Ss (V) contributing them were 
as follows: S,;-A,—4 (N = 4); S,-Ar— 
5 (N = 5);S:;-An—S5 (N = 5);S,-Ar— 
19 (N = 13). When the 30-sec. and 
7-day Ss assigned to the same block 
are compared, the,‘number of cases 
in which there is an increase in 


was 


TABLE 9 


MEAN NuMBERS OF PatrS RECALLED IN 
Exp Il 


30-Sec. Recall 7-Day Recall 
Stimulus List Stimulus List 


Sp | Si 
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7 days 


oo, 
een >) 


o— Hi-F 
Xe-m Med-F 
O-== Lo-F 


Mean number ‘correct 








Fic. 2. Mean numbers of correct re- 
sponses in relearning of verbal discrimination 


(Exp. III). 


importations during the retention 
interval is significantly greater for 
List S; than for List S, (x? = 4.52, 
df =1, .02< P< .05). It is clear 
that this difference is primarily a 
function of the number of imported 
substitutions for low-probability as- 
sociates. In the recall of List Sy, 
none of these substitutions were 
primary associates of the appropriate 
Ss: in the of List S;, 16/19 
substitutions were primary associates. 

Relearning. 


case 


-The mean numbers of 
trials to relearn to a criterion of 5/5 
on each type of pair and 10/10 for 
the entire list are shown in Table 8. 
The the amount of 
forgetting as measured by speed of 
relearning correspond to those found 
at recall. The procedure used in 
analyzing the temporal trends in 
recall was also applied to the relearn- 
ing scores. The over-all difference 
between Lists was again not reliable 
(F = 1.53), whereas the interaction 
of Lists with Pairs is 


(F=5.11, df=1/30, 


differences in 


significant 
.02<P<.05). 


EXPERIMENT III 
Method 


Materials.—The learning materials were 
three lists of 10 pairs of words each. One 
list (List HF) was composed of words of high 
frequency of usage, one list (List MF) of 
words of medium frequency, and one list 
(List LF) of words of low frequency. The 
words used in the construction of the lists 
were the same as those in the S lists and R 
lists of Exp. I (see Table 1). Two different 
pairings of the words in each list were used 
equally often. For purposes of verbal dis- 
crimination learning, one of the items in each 
pair was designated as correct. Each item 
in a pair was correct for half the Ss. 

Procedure-——The members of a pair were 
presented one above the other in the window 
of a Hull-type memory drum. The rate of 
presentation was 1.5 sec. for the pair alone, 
and 1.5 sec. for the pair and the correct R 
together. The intertrial interval was 3 sec. 
Four different orders of presentation were 
used to minimize serial learning. In each 
order half of the correct words appeared in 
the upper position and half in the lower posi- 
tion. The position of the correct item in each 
pair varied from trial to trial. Each of the 
four orders was used as a starting order 
equally often. The Ss began anticipating the 
correct R on the first trial. Learning was to 
a criterion of one perfect trial. 

Retention was tested by relearning either 
30 sec. or 7 days after the end of practice. 
Relearning was for five trials or to criterion, 
whichever took the longer. 

Subjects.—With three types of lists and 
two retention intervals, there were six groups 
of 16 Ss each. All Ss were undergraduates 
who had English as their native language; 
they were not necessarily naive to rote-learn- 
ing experiments but had had no prior experi- 
ence with verbal discrimination learning. 
The Ss were assigned to conditions in blocks 
of 6, with 1 S per block for each combination 
of lists and retention intervals. The running 
order within each block and the assignment 
to different starting orders were determined 
by tables of random numbers. 
lost because of failure to learn. 


No Ss were 


Results 


Original learning. 
significant 


There were no 
differences between the 


30-sec. and 7-day groups learning a 
given kind of list. 
bers of trials to criterion for the com- 


The mean num- 
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bined groups were as follows : HF—6.4 
(SD = 4.0); MF—4.5 (SD = 1.5); 
LF—5.3 (SD = 2.5). The effects of 
word frequency on acquisition parallel 
those obtained on the S side in paired- 
associate learning in Exp. I, i.e., List 
MF is learned faster than Lists HF and 
LF. The variation in criterion scores 
is significant (F = 3.63, df = 2/93, 
G2.< F < 0). 

Recall and relearning.—Figure 2 
shows the mean numbers of correct 
Rs on the five trials of relearning 30 
sec. and 7 days after the end of prac- 
tice. The 30-sec. test reflects the 
rank order of the conditions in original 
learning. Immediate relearning pro- 
ceeds less steadily for List HF than 
for the other two lists. The disad- 
vantage of List HF becomes pro- 
nounced on the 7-day test. A note- 
worthy fact is that the difference 
between List HF and the other two 
lists between Trial 1 and 
Trial 2 and that this separation is 
maintained through Trial 4. The 


increases 


conditions which slowed up the acqui- 
sition of List HF rapidly come into 
play again in relearning. 


The separa- 
tion between Lists MF and LF is 
consistently small. 

The differences on the recall trial 
are not significant. For the five trials 
of relearning, the over-all variation 
among lists is significant (F 
df = 2/90, P < .01). With perform- 
ance on List HF poorest on both tests, 
the Time X Lists interaction falls 
short of significance (F = 2.12, 
df = 2/90). The pattern of differ- 
ences and the results of the statistical 
tests are the same when trials to 


relearn to criterion are considered. 


pa 
PY 2 


DISCUSSION 


The results of the experiments are 
consistent with the assumption that unit- 
sequence interference and unit-sequence 
facilitation covary as a function of word 


frequency, with the relative amount of 
effective interference increasing more 
rapidly on the S than on the R side. 
Such systematic changes in the pattern 
of transfer effects are indicated by the 
findings on acquisition in Exp. I: (a) 
rate of overt errors is a positively ac- 
celerated function of the word frequency 
of both Ss and Rs; (b) speed of learning 
increases steadily with the word fre- 
quency of Rs; (c) speed of learning first 
increases and then decreases as a func- 
tion of the word frequency of Ss. 

Speed of learning is influenced by the 
word frequency of both Ss and Rs where- 
as the amount of retention varies signifi- 
cantly only with S conditions. In acqui- 
sition the availability of the R_ units, 
which is a function of frequency, is a 
major determinant of the speed of learn- 
ing (Underwood & Schulz, 1960). Once 
the Rs are available to the learner, the 
associates elicited by the Ss are of major 
importance in determining the speed at 
which the prescribed associative con- 
nections are established. Retention losses 
occur as incorrect associations which 
had been unlearned during acquisition 
recover in strength relative to the pre- 
scribed sequences. On the recall test, 
the competing Rs elicited by the S terms 
largely determine the amount of effec- 
tive interference. The rate at which 
such competing associations recover with 
the passage of time reflects the balance 
of positive and negative transfer effects 
in acquisition. There appears to be no 
differential decline in the availability 
of the Rs when well integrated units 
such as words are used. 

Experiment I] 
experimental 
associates 


that the 
probabilities of 
have a 


shows pre- 
spec ific 
major influence on 
learning and retention when the word 
frequency of the Ss is low but have no 
differential effects when the 
quency of the Ss is high. 


word fre- 
In the acquisi- 
tion of List S,, strong associative con- 
nections either existed or were readily 
established by direct mediational links 
between any pair of words in the list, 
and the primary associates had no 
advantage over associates of low rank. 
In the case of List S;, on the other hand, 
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the uncontrolled transfer effects of pre- 
experimental habits were weak and did 
not mask the difference between 
ciates of high and low probability. 

The results of the retention tests in 
Exp. II clearly bring out the differences 
in unit-sequence interference and facili- 
tation falling on the pairs in the two 
lists. When facilitation by pre-experi- 
mental habits obtains for the pairs in 
both lists (Sy-A, and S)-Ay), retention 
is inversely related to the word frequency 
of the Ss. Under these conditions the 
rate of forgetting directly reflects the 
assumed difference in unit-sequence in- 
terference. In the case of the low-prob- 
ability pairs (S,-A; and S)-A;) there is 
interference from language habits in 
both lists, as indicated by the intralist 
errors in List S, and the frequent impor- 
tation of primary associates into List S). 
For these pairs, retention varies directly 
with word frequency, now reflecting the 
assumed difference in unit-sequence fa- 
cilitation. The question arises, of course, 
whether the differences in recall, and 
for that matter in acquisition, are simply 
the result of a set to give primary 
associates. While such a set was un- 
doubtedly present, the critical finding is 
that it was effective only with Ss of low 
word frequency. With Ss of high word 
frequency, no one associate is clearly 
dominant, and Rs from outside the list 
cannot compete effectively with those 
within the list. 

The fact that speed of verbal discrimi- 
nation learning first increases and then 
decreases as a function of word frequency 
(Exp. III) provides additional support 
for the assumed covariation of unit- 
sequence facilitation and interference. 
With homogeneous pairs such as were 
used in this study, differential effects 
of R availability are minimized (cf. 
Runquist & Freeman, 1960), and the 
relationship parallels that obtained for 
Ss in paired-associate learning. The 
availability of strong differential Rs is 
essential for the achievement of a stable 
discrimination, but once the number of 
such Rs exceeds an optimal value, gen- 
eralization across items counteracts the 
beneficial effects of R-produced differ- 


asso- 


entiation. It is likely that generalization 
both within pairs and between pairs 
contributes to the total effect. While 
recall was too high to yield reliable dif- 
ferences, List HF is clearly inferior to the 
other two lists in relearning. 

In agreement with the results ob- 
tained earlier with serial lists (Postman, 
1961), the present experiments offer no 
support for the assumption that mean- 
ingfulness necessarily favors retention. 
By maximizing either the negative or 
positive transfer effects of pre-experi- 
mental habits, it is possible to obtain 
either an inverse or a direct relationship 
between meaningfulness and retention. 
For an average sample of verbal materi- 
als, the best prediction at present is 
that the differences in retention will 
be small when degree of original learning 
is held constant. 


SUMMARY 


Three experiments investigated the trans- 
fer effects of language habits on the acquisi- 
tion and retention of verbal 
They tested the assumption that both positive 
and negative transfer effects (unit-sequence 
facilitation and unit-sequence interference 
increase as a function of the frequency of usage 
of words. The balance of unit-sequence 
interference and facilitation determines the 
speed of acquisition and the rate at which 
interferences recover with the passage of time. 

In Exp. I lists of paired associates repre- 
senting all possible combinations of S and R 
terms of high, medium, and low word fre- 
quency were used. Speed of acquisition first 
increased and then decreased as a function 
of the word frequency of Ss but varied directly 
with the word frequency of Rs. Amount of 
retention did not vary significantly as a 
function of R conditions whereas the effect 
of S frequency paralleled that obtained in 
acquisition. While the availability of Rs and 
pre-experimental associative probabilities both 
influence speed of acquisition, amount of 
retention loss appears to be determined pri- 
marily by the recovery of competing associa- 
tions elicited by the S terms. 

In Exp. II associates of high and low pre- 
experimental probability, as determined by 
norms of free association, were learned to Ss 
of high and low word frequency. Pre-experi- 
mental associative probability has significant 
effects on learning and retention when the 


associations. 
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word frequency of the Ss is low but not when 
it is high. Whether or not manipulation of 
the conditions of facilitation and interference 
is successful depends on the magnitude of the 
uncontrolled transfer effects of 
mental habits. 


pre-experi- 


The method of verbal discrimination learn- 
ing was used in Exp. III with pairs of words 
of either high, medium, or low word frequency. 
Speed of acquisition first increased and then 
decreased as a function of the word frequency 
of the paired items. Thus, the relationship 
paralleled that obtained for the word fre- 
quency of Ss in paired-associate learning. 
The differences in recall were unreliable, but 
the high-frequency lists were relearned more 
slowly than those of medium or low frequency. 
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Kimble (1961) has recently shown 
that the amplitude of the uncondi- 
tioned eyeblink is attenuated by the 
presence of the CS during reinforced 
CS-UCS conditioning trials. Assum- 
ing that this diminution of the UCR 
is a manifestation of a conditioned 
inhibitory process under the control 
of the CS, Kimble and Ost (1961) 
varied the CS-UCS interval during 
conditioning to determine whether 
or not the degree of inhibition varied 
a function of the interstimulus 
interval. They found thatthestrength 
of the conditioned inhibitory process 
was related to the interstimulus inter- 
val in the same way as is the condi- 
tioned excitatory process, i.e., the 
greatest amount of recovery in UCR 
amplitude occasioned by the omission 
of the CS occurred when the interval 
during conditioning had been 0.5 sec. 

The purpose of the present study 
was to extend these eyelid condition- 
ing findings to another classically 
conditionable human response, the 
GSR, and to investigate the relation- 
ship between this conditioned inhibi- 
tory process and another parameter 
of the conditioning process, the num- 
ber of reinforcements. It was hy- 
pothesized that attenuation of the 
UCR during reinforcement and re- 
covery of the UCR upon omission of 


as 


1 The junior author’s collaboration in this 
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,UTION OF THE UNCONDITIONED 
THE NUMBER 
REINFORCEMENTS! 


PENNYPACKER 


Duke University 


the CS would both vary positively 
with variations in the number of 
reinforcements. 


METHOD 


Subjects.—Thirty students in introductory 
psychology at Duke University were Ss. 
The Ss volunteered to meet a class require- 
ment. They were assigned randomly to 
three groups of 10 each, to receive different 
numbers of reinforcements, but were other- 
wise treated identically 

Instructions.—The S read the instructions 
from a typewritten card. They indicated 
that this was an experiment on the effect of 
environmental stimulation upon the GSR 
and that S’s task was to remain relaxed and 
motionless but to pay attention to the stimuli 
during the experiment. 

A pparatus.—The CS was a 1000-cps tone 
produced by General Radio Company equip- 
ment and delivered to S via Trimm Company 
earphones. The intensity of the tone, rated 
at the earphones, 40 db. (re.: .0002 
dynes/cm?). 

The UCS was an electric shock produced 
by a Psychological Instruments Company 
stimulator and delivered to the index and 
middle fingertips of S’s right hand. The 
shock intensity used during conditioning 
was 2 ma. at 50 v., for a hypothetical S with 
a resistance of 25,000 ohms. 

The durations of the stimuli were con- 
trolled electronically. The CS had a duration 
of 1.0 sec. and the UCS a duration of 0.1 sec. 
On paired conditioning trials a delayed 
paradigm was used with a 0.9-sec. interval 
between onsets of the two stimuli. 

The GSR was measured as the maximum 
decrease in resistance which occurred within 
after the offset of a stimulus. The 
response was picked up mom the palm and 
back of S's left hand by }-in. zinc electrodes, 
covered with a few rk of zinc sulphate 
solution, in lucite cups filled with saline elec- 
trode jelly. It was amplified by a Hunter 
GSR apparatus and recorded on an Esterline- 
.\ngus ink-writing milliammeter with a paper 


was 


3 sec. 
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speed of 3 in/min. All responses were trans- 
formed to the square root of conductance 
change ( VAC) for statistical purposes. 

Procedure.—Data were collected in a quiet 
dark room. An electric fan masked extrane- 
ous sounds. The E and the apparatus were 
located in an adjoining room. 

The S’s left hand was cleaned with acetone 
before the GSR electrodes were attached. 
He was then seated in the experimental room 
and given the instructions to read before the 
pickup leads were attached to the GSR elec- 
trodes, the shock electrodes were taped to his 
fingertips, and the earphones were placed 
on his head. 

The S was first given 2 presentations of 
the CS alone followed by 3 presentations of 
the shock alone, in intensity increments up to 
the intensity used during conditioning. This 
was followed by 4 additional presentations 
of the CS alone. During the reinforcement 
period, which occurred next, the three groups 
of Ss received either 4, 8, or 16 paired presen- 
tations of the CS and UCS. After reinforce- 
ment 2 presentations of the shock alone were 
given and, finally, 4 presentations of the CS 
alone (i.e., extinction). All presentations 
(i.e., trials) were separated by intervals of 
30 to 60 sec., varied unsystematically by E. 


RESULTS 


Of primary significance in evaluat- 
ing the hypotheses was the way in 
which the amplitude of the UCR 
varied over trials during reinforcement 
and the degree to which an increment 
in amplitude of the UCR was pro- 
duced by the omission of the CS after 
reinforcement. Figure 1 shows the 
average amplitude of GSR for each 
group of Ss, as a function of reinforced 
trials. Also indicated in Fig. 1 are 
the average amplitudes of GSR on the 
two test trials of the UCS only (shown 
in the figure). Each data point repre- 
sents the mean of 10 Ss. 


Figure 1 indicates that the ampli- 
tude of the UCR gradually reduced 


during reinforcement, although the 
groups differed in the extent to which 
this reduction occurred. Statistical 
analysis of the degree of diminution 
of GSR amplitude was achieved by 


Re atorcements 
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Fic. 1. Average amplitude of the un- 
conditioned GSR during reinforcement and 
on test trials without the CS. 


comparing the average amplitude of 
GSR on the first two reinforced trials 
(combined) with the average ampli- 
tude of GSR on the last two reinforced 
trials (combined). Thus, for the 
group which received four reinforce- 
ments the two values used were the 
average of the first two reinforced 
trials and the average of the third and 
fourth reinforced trials. The values 
used for the group which received 
eight reinforcements were obtained 
from the first two reinforced trials and 
the seventh and eighth reinforced 
trials, etc. Analysis of variance of 
these average measures of UCR ampli- 
tude during reinforcement showed 
that the overall reduction in UCR 
amplitude was statistically significant 
(P < .05, error variance = .2616) but 
that it did not interact significantly 
with the number of reinforcements as 
had been expected. 

Figure 1 also shows the degree to 
which the omission of the CS pro- 
duced an increment in UCR ampli- 
tude. The exact quantitative nature 
of the relationship between the num- 
ber of reinforcements and the degree 
of increment produced by the omis- 
sion of the CS is shown in Fig. 2, 
which presents the average difference 
(i.e., increment) in GSR amplitude 
between the last two reinforced trials 
(combined) and the two test trials 
of UCS only (combined). A _ log- 
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arithmic scale has been used for the 
abscissa in Fig. 2 to emphasize the 
linearity of the relationship when 
expressed in this way. 

It is clear in Fig. 2 that the degree 
to which the omission of the CS pro- 
duced an increment in UCR amplitude 
increased as a linear function of the 
logarithm of the number of reinforce- 
ments, i.e., as a negatively accelerated 
growth curve. The analysis of vari- 
ance of the averaged UCRs on the 
last two reinforced trials and the two 
UCS-only trials indicated that the 
overall increment in UCR amplitude 
produced by the omission of the CS 
was statistically significant (P <.025) 
and that the interaction between this 
increment and the number of rein- 
forcements approached significance 
(P < .07). The linear component of 
the trend over log N was statistically 
significant (P < .025). Error vari- 
ance for these tests = .1331. 

It was of interest also to examine 
the relationship between the number 
of reinforcements and GSR amplitude 
during extinction. This evaluation 
was contaminated in part by the fact 
that all of the Ss had received two 
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Fic. 2. The relationship between the 
number of reinforcements and the amount 
of increment produced by the omission of 
the CS. 


presentations of the UCS alone be- 
tween the end of reinforcement and 
the beginning of extinction, which 
probably tended to obscure any dif- 
ferences in degree of conditioning 
which might have occurred. For this 
reason only the last three extinction 
trials were used to evaluate the degree 
of conditioning which occurred. When 
the average amplitude of GSR on 
these three trials was compared to 
the average amplitude of the GSR on 
the last three presentations of the CS 
alone prior to conditioning it was 
found that the average amplitude 
of GSR increased greatly from before 
to after the reinforcement period and 
that this increase was largest in the 
group which received eight reinforce- 
ments. The overall difference from 
before to after reinforcement was 
highly significant, supporting the con- 
clusion that conditioning occurred, 
but the interaction between the 
number of reinforcements and degree 
of conditioning failed to reach 
significance. 


DISCUSSION 


The results of this experiment sup- 
port the notion that the CS becomes 
capable during conditioning of attenuat- 
ing the amplitude of the UCR. Not 


only did the amplitude of the UCR 
diminish during reinforcement, but, when 
the CS was omitted and the UCS pre- 
sented alone, the UCR recovered to 
approximately its original amplitude. 

It is of little significance to point out 
that S was surprised to receive the UCS 
without the CS and that this surprise, 
acting as a startle stimulus, may have 
been responsible for the increment in 
the GSR on shock-alone trials. The 
fact that the presence of the CS serves as 
a warning to S that the shock is coming 
is, of course, one of the things that are 
“learned” during the conditioning proc- 
ess. Even when viewed in this way, 
the reduction-of-surprise function of 
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the CS is essentially an inhibitory func- 
tion. That this inhibitory aspect of the 
classical conditioning process is highly 
adaptive to the organism being condi- 
tioned is apparent, since a good deal of 
unnecessary emotional overresponding 
is brought under control or avoided by 
the association of the warning signal 
(i.e., the CS) with the noxious event 
which is imminent. 

The relationship between the present 
findings and those of an earlier study 
of disinhibition in GSR _ conditioning 
(Kimmel & Fowler, 1961) is of some 
significance. In the earlier study it was 
shown that the addition of an extra, or 
disinhibiting, stimulus to the CS after 
reinforcement had the effect of producing 
an increment in the amplitude of the 
conditioned GSR and that this effect 
was positively related to the number of 
reinforcements given prior to the addi- 
tion of the extra stimulus. The inter- 
pretation of the results of the earlier 
study was based upon the assumption 
that the CS acquired inhibitory capa- 
bilities during reinforcement and that 
the extra stimulus inhibited this in- 
hibition, producing an increment in 
response amplitude, i.e., disinhibition. 
The present findings support this inter- 
pretation, since the omission of the CS 
had the same behavioral consequence 
as the inhibition of the inhibition con- 
trolled by the CS, i.e., in both cases 
the inhibition was removed or reduced. 
The observed positive relationship be- 
tween these effects and the number of 
reinforcements supports the contention 
that they result from an associative 
process as, of course, do the findings of 
Kimble and Ost (1961) that the con- 
ventional interstimulus interval function 
is involved. 


SUMMARY 


This study tested the hypotheses that 
the amplitude of the unconditioned GSR is 
gradually attenuated by the CS during con- 
ditioning and that this reduction in UCR 
amplitude vanishes when the CS is omitted 
and the UCS presented alone. Three groups 
of 10 Ss received either 4, 8, or 16 reinforced 
presentations of a tone CS and electric shock 
UCS in a classical delayed paradigm. Fol- 
lowing reinforcement the CS was omitted and 
2 UCS alone trials were administered. After 
the tests on the shock alone, 4 extinction 
trials of the CS alone were given. 

It was found that the amplitude of the 
unconditioned GSR diminished during rein- 
forcement and that the UCR recovered to 
approximately its original amplitude when 
the CS was omitted. The amount of incre- 
ment produced by omitting the CS was a 
linear function of the logarithm of the number 
of reinforcements. These results were inter- 
preted as supporting the notion that a condi- 
tioned inhibitory process develops during 
reinforcement, under the control of the CS, 
which attenuates the amplitude of the UCR 
in the presence of the CS. The relationship 
between these findings and similar findings 
in a study of the relationship between amount 
of disinhibition and number of reinforcements 
was thought to add further support to the 
interpretation offered. 
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THE EFFECTS OF DRIVE AND DISCRIMINATION 
TRAINING ON STIMULUS GENERALIZATION ! 


DAVID R. 


THOMAS? 


Duke University 


In 1959 Hanson reported in this 
journal a study on the influence of 
discrimination training on stimulus 
generalization. Hanson trained pi- 
geons to peck at a key illuminated by 
a monochromatic light of 550 mu, 
under VI reinforcement schedule. 
After successive discrimination train- 
ing, Ss were subjected to a generaliza- 
tion test in extinction, following a 
procedure introduced by Guttman 
and Kalish (1956). 

Hanson found that time to the 
discrimination criterion varied as a 
negatively accelerated decreasing func- 
tion (S+, S—) difference. Post- 
discrimination generalization gradi- 
ents (PDGs) were higher and steeper 
than a control gradient and in addi- 
tion, showed a shift in the peak of 
responding from the S+ in the direc- 
tion away from the S—. The extent 
of this shift varied also as a negatively 
accelerated decreasing function of 
(S+, S—) difference. 

The present study is an extension 
of Hanson’s work. The Ss were 
tested on three different discrimina- 
tion problems at three different levels 
of food deprivation. The literature 
is inconsistent with regard to the 

! This study was taken from a dissertation 
submitted to the Psychology Department of 
Duke University in partial fulfillment of the 
requirements for the PhD degree in psy- 
chology. The writer is greatly indebted to 
Norman Guttman under whose direction the 
investigation was conducted. Thanks are 
also due to Doris Homa Thomas for running 
some of the Ss. At the time the research was 
conducted the author Predoctoral 
Research Fellow of the United States Public 
Health Service. 

2 Now at Kent State University. 


was a 


influence of drive level on discrimina- 
tion learning, and no study appears 
to have been reported using the 
operant free-responding technique. 
In this study, Ss were tested for 
generalization before discrimination 
training was initiated, and in addition, 
short generalization tests were inter- 
spersed during the course of discrimin- 
ation training so as to trace the 


development of the changes in the 
PDG. 


METHOD 


Subjects—The Ss were 54 experimentally 
naive white carneau pigeons obtained from 
the Palmetto Pigeon Plant in Sumpter, 
South Carolina. 

Apparatus—A bank of four identical 
Skinner type key pecking apparatuses was 
used. Each box had the following internal 
dimensions: width, 154 in.; depth, 14} in.; 
height, 14} in. 


Walls and ceiling were painted 
flat black; floors were of unpainted Masonite. 
The S’s key of translucent plastic was exposed 
through a j-in. circular aperture placed 64 in. 


above the floor on one side of the box. Di- 
rectly below the key in the floor of each box 
was a 1-in. circular aperture through which 
Ss had access to the food magazine. The 
magazine, which was operated by a motor- 
driven cam, allowed S approximately 4 sec. 
of access to food during each cycle. Between 
cycles, the food was lowered beyond S’s reach. 
A Plexiglas light fixture with a 15-w. bulb, 
placed directly above and in front of the floor 
opening illuminated the opening and the 
magazine for the duration of each cycle. 
Aside from the stimulus light and the 
magazine light during magazine cycles, the 
boxes were in darkness throughout the experi- 
ment. One box, Number 1, 
for generalization testing. The source of 
illumination for the key in this box was a 
Bausch and Lomb diffraction grating mono- 
chromator, Model 33-86-40, equipped with a 
108-w., 6-v., ac ribbon filament tungsten lamp. 
Boxes 2, 3, and 4 were used for training 


was set aside 
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purposes only. The keys of these were 
illuminated by 100-w., 120-v., ac projection 
lamp. Positive and negative stimuli were 
provided by Bausch and Lomb interference 
filters. Each box was equipped with a “‘posi- 
tive filter,’’ (transmission peak 550 my) and a 
“negative filter,” (590 my for Box 2, 570 my 
for Box 3, and 558 my for Box 4). The bright- 
ness of the filter colors was matched to the 
monochromator colors by the addition to the 
filters of an appropriate number of pieces 
of exposed film (Polaroid No. D 77843). 
The matching was accomplished with the aid 
of a photomultiplier tube and its associated 
amplifier. 

The three training boxes were set up to 
change stimuli automatically according to a 
prearranged schedule punched on a tape run 
on a Gerbrands programer. Throughout the 
entire experiment, masking noises were sup- 
plied to all boxes by a Grason-Stadler noise 
generator, Model 901. 

Procedure.—U pon arrival at the laboratory, 
all Ss were weighed, individually caged, 
and allowed free access to food and water. 
Chroughout the entire experiment free access 
to water was always available in the home 
After 4 to 10 days of free feeding, a 
stable weight level was achieved by all Ss 
and food deprivation was begun. The Ss 
were randomly assigned to three body weight 
levels, 80% of ad lib. weight, 70%, and 60%, 
and to the three different discrimination 
problems, thus creating nine groups of Ss, 
one for each combination of weight level and 
discrimination problem. 


cages. 


Deprivation ceased 
for each S when the appropriate weight was 
reached. At this point training was begun. 

All Ss were adapted, magazine trained, 
and conditioned to key peck according to a 
set schedule covering 5 days. During this 
time, the box was in complete darkness with 
the exception of the key light and the maga- 
zine light during reinforcement cycles. 

All Ss were given VI reinforcement for 5 
days following conditioning. Thirty 60-sec. 
stimulus-on periods, alternated with 10-sec. 


stimulus-off periods, were given each day. 


The mean interval between reinforcements, 
not counting stimulus-off periods, was ap- 
proximately 60 sec, with a range from 4 sec. 
to 4 During stimulus-off periods, a 
shutter operated, removing the light from 


min. 


the key and leaving the experimental box 
dark. were 


during a stimulus-off period. 


Reinforcements never given 
One of the five 
VI sessions for each S was administered in 


Box 1, the test box, so as to accustom Ss 


to the test box and to the monochromator- 
produced stimulus. 

On the next day, after a 5-min. warm-up 
period of VI in the test box, each S was sub- 
jected to a generalization test in extinction. 
Eleven different test stimuli were used 
510, 520, 530, 540, 545, 550, 555, 560, 570, 
580, and 590 my. The 11 test stimuli were 
randomized within a series and six different 
random series were presented to each S. 
This resulted in a schedule of 66 stimulus 
presentations. Each stimulus presentation 
was for 30 sec. and was followed by a 10-sec. 
blackout period, during which the number 
of responses in the preceding period was 
recorded and the stimulus was changed. 

On the day following the (preliminary) 
generalization test, discrimination training 
was begun. All three discrimination groups 
were trained with the same positive stimulus 
(S+) 550 my, and differed only with respect 
to the negative stimulus (S— Each dis- 
crimination group consisted of 18 Ss, 6 at each 
of the three body weight levels. These groups 
may be designated by the negative stimulus 
used: 590 my (trained in Box 2), 570 my 
(Box 3), and 558 my (Box 4). During 
discrimination training, responding to the 
positive stimulus was reinforced according 
to the same VI schedule previously used. 
Responding to the negative stimulus was 
never reinforced. The positive and negative 
stimuli were presented successively in a pre- 
arranged random order. Fifteen 1-min. 
intervals of S+ and 15 of S— were presented 
each day. All stimulus changes were made 
during the 10-sec. blackout periods. The 30 
stimulus presentations comprised three blocks 
of 10, and within each block there were 5 
positive and 5 negative stimuli. Discrimina- 
tion training was continued until a criterion 
of no responding in five successive periods of 
S— combined with continued responding to 
S+ was achieved. 

During the course of discrimination train- 
ing, a three-series generalization test of 33 
stimulus presentations was administered at 
the completion of every even numbered daily 
session of discrimination training. If the 
discrimination criterion was met on a day 
on which an interpolated generalization test 
was scheduled, the test was omitted. On the 
day following the achieving of the criterion 
for discrimination, a final generalization test 
was administered to each S. The final test 
was carried out in the same manner as the 
preliminary and interpolated tests, with the 
exception that 12 test series (132 stimulus 
presentations) were used. 
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Fic. 1. Postdiscrimination generalization 
gradients after three different discrimination 
problems, pooled over three drive levels. 


RESULTS AND DISCUSSION 


Data bearing on three different 
problems will be presented in this 
report. The first problem concerns 
the roles of hunger drive level and 
(S+, S—) difference in determining 
the rate of formation of a discrimina- 
tion. The present experiment is fully 
in agreement with Hanson (1959) in 
the finding that the rate of forma- 


tion of a discrimination varies in- 


versely with the physical difference 


between the stimuli to be discrimi- 
nated. Analysis of variance indicates 
that minutes to criterion varies as a 
function of the (S+, S—) difference 
(F = 23.50, df = 2/45, P < 01). A 
beneficial influence of drive level is 
suggested, but is not, demonstrated 
at a_ statistically acceptable level 
of confidence (F = 2.96, df = 2/45, 
.05 <P <.10). The results seem 
sufficiently promising, however, to 
warrant a more direct attack of the 
problem, without the interpolation 
(as in the present case) of other 
experimental treatments (preliminary 
and interpolated generalization tests). 
The interaction between the (S+,S—) 
difference and drive level is not 
significant (F < 1, df = 4/45). 

In the present experiment depriva- 
tion levels down to 60% of ad lib. 
weight were employed. 
limits of the 


Thus, the 
hunger motive were 
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assessed. We may therefore conclude 
that even at its extremes, the hunger 
drive exercises far less control over 
the rate of discrimination learning 
than does the physical difference 
between thestimuli to bediscriminated. 

The second problem to be con- 
sidered here concerns the roles of drive 
level and the (S+, S—) difference in 
determining the location of the PDGs. 
In Fig. 1 are presented the PDGs of 
the three problem groups pooled over 
all drive levels. The PDGs show 
the characteristic displacement of the 
peak from the S+ an amount in- 
versely related to the (S+,S—) dif- 
ference. A convenient measure of 
the location of a gradient is derived 
by treating it as a grouped frequency 
distribution and computing the mean 
value. The difference between the 
means of the 18 individual mean 
scores in the three problem groups is 
significant (F = 10.74, df = 2/45, 
P <.01). Theeffect of the (S+,S—) 
difference on the location of the PDG 
is mirrored at all three levels of drive. 
However, the effect of drive level on 
the means of the final generalization 
gradients fails to achieve a statis- 
tically acceptable level of confidence 
(F = 2.56, df = 2/45, .05 < P < .10). 
Neither the interaction between 
the (S+, S—) difference and drive 
level significant (F = 1.07, df = 4/45). 
Thus, Hanson’s findings with regard 
to the effect of the (S+,S—) difference 
on the location of the central tendency 
of the PDG are replicated at three 
different levels of drive. 

In agreement with Hanson, we have 
found that both the time to criterion 
and the amount of displacement of the 
PDG are inversely related to the 
(S+,S—) difference. This suggests 
the possibility that amount of training 
is the vehicle through which the 
(S+,S—) difference has its effect on 
the PDG. The analysis of this ques- 


1S 
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tion was the third major problem to 
which the present study addressed it- 
self. The administration of a short 
generalization test to all Ss after 2 days 
of training (with the exception of 
those Ss which had already met the 
learning criterion) makes possible an 
evaluation of the effects of the (S+, 
S—) difference, with amount of train- 
ing held constant. On the other hand, 
the analysis of the series of generaliza- 
tion tests given to all Ss during the 
course of discrimination training re- 
veals the role of amount of training 
with the} (S+, S—) difference con- 
trolled. 

Although only 1 S from the 590-my 
group failed to achieve the criterion 
of discrimination learning within 
2 days, 14 Ss from the 570-my group 
and 16 Ss from the 558-my group 
could be compared in this manner. 
For each S a “displacement score”’ 


rABLE 1 


DISPLACEMENT OF THE MEAN OF THE GENER- 
ALIZATION GRADIENT WITH TWO SESSIONS 
OF DISCRIMINATION TRAINING A 
lwo DIFFERENT (S+,5 
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Fic. 2. Mean of the gradient as a function of 


ordinal position of test. 


was obtained by subtracting the mean 
of the second generalization test from 
the mean of the first. The means of 
these displacement scores for the two 
groups were virtually identical, 5.6 
my for the 570-my group vs. 5.7 mu 
for the 558-my group. The mean 
displacement is the same in spite of 
the fact that in one case the (S+,S—) 
difference is more than twice that of 
the other! 

In Fig. 2 are presented the means 
of the preliminary generalization gra- 
dients obtained from all Ss, the means 
of the second obtained gradient, the 
third, etc. The data from all groups 
have been pooled since it appears that 
neither the (S+, S—) difference nor 
drive level have any marked effect 
on the location of the PDG if the 
amount of training is held constant. 
Under each of the values is recorded 
the number of Ss whose scores are 
represented in that mean value. It 
should be noted that the points 
plotted were not obtained from dif- 
ferent groups of Ss. On the con- 
trary, the 5 Ss which received five 
generalization were included 
among the 22 Ss which received (at 
least) four generalization tests, which, 


tests 
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in turn were among the 32 Ss which 
received (at least) three generalization 
tests, etc. The value of N con- 
tinually decreases indicating that all 
Ss had at least two generalization tests 
whereas progressively fewer Ss had 
three tests, four tests, etc. The figure 
shows a negatively accelerated de- 
creasing function. 
sessions of 


Even as few as two 
discrimination training 
produces a marked shift in the loca- 
tion of the gradient. 


The procedure has made 
possible the separation of the effects of 
the (S+, S—) difference and amount of 
training on the location of the PDG. 
When this is done it appears that the 
effects of the (S+, S—) difference are 
mediated through the amount of training. 
There is no obvious explanation of the 
failure of the (S+, S—) difference to have 
a direct effect on the location of the PDG. 
It is clear, however, that this finding 
raises serious doubt about the adequacy 
of a Spence-type theory of inhibition 
(Spence, 1936, 1937) to account for the 
process of successive discrimination learn- 


present 


ing in the operant free-responding situa- 


tion. It is to be hoped that future 
studies will suggest an alternative theo- 
retical framework which will have greater 
predictive and explanatory value within 
this context. 


SUMMARY 


The effects of drive and discrimination 
training on stimulus generalization were 
studied in the pigeon. Three groups of 18 Ss 
each, maintained at 60%, 70%, and 80% of 
ad lib. weight, respectively, were trained to re- 
spond to a key illuminated by a light of 550 
my. After 5 days of VI training, Ss were tested 
for generalization to stimuli ranging from 500 
my to 600 mu. Then the three groups were 


subdivided into three discrimination problem 


groups, 550 my positive for each, but with 
590 my, 570 mp, and 558 my negative, 
respectively. During discrimination training, 
responding to S+ was VI reinforced, respond- 
ing to S— was never reinforced. Periodically 
during the course of discrimination training 
short generalization tests were given to all 
Ss. After the criterion of discrimination was 
met, all Ss were subjected to a final generali- 
zation test. 

The major conclusions were: (a) minutes to 
criterion varies inversely with the (S+, S—) 
difference; a beneficial effect of drive is sug- 
gested but not conclusively demonstrated 
(.05 < P <.10). (6) Discrimination training 
produces a general steepening of the PDG, a 
lowering of the gradient in the region of S—, 
and a shift of the central tendency from the 
region of S+ in the direction away from S—, 
the amount of shift varying inversely with 
the (S+, S—) difference. This finding is 
replicated for all three levels of drive. (« 
Generalization gradients obtained during the 
course of discrimination training reveal that 
the mean of the gradient shifts in a negatively 
accelerated manner as a function of the 
amount of discrimination training. The 
amount of training appears to be the vehicle 
through which the effect of the (S+,S—) dif- 
ference on the location of the PDG is medi- 
ated. Displacement varies with amount of 
discrimination training, independent of any 
direct effect of the (S+, S—) difference. 
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THE LEARNING OF RESPONSES TO MULTIPLE 
WEIGHTED CUES! 


STANLEY 


Uni 


In many situations a response is 


dependent simultaneously on several 
different cues, i.e., on several poten- 
tially useful stimulus attributes. The 
purpose of this experiment was to 
analyze the learning of responses to 
simultaneously presented cues of dif- 
ferent validities, in order to deter- 
mine how much the responses came 
to depend on each cue. The implica- 
tions are relevant especially to situa- 
tions in which individuals react to 
several influences in reaching a judg- 
ment, decision, or interpretation. 


Questions about the effects of complex 
stimuli have been answered by several 
related types of experimental analysis, 
such as investigations of stimulus-com- 
pound patterning, of probability learn- 
ing, and of the utilization of multiple 
cues. In studies of cue utilization, cor- 
may be taken each 
of several cues and the response variable. 
This approach, 
particularly associated with 
of Brunswik (1956), 
present experiment. 


relations between 


correlational which is 
the work 
used in the 
Brunswik devel- 
oped the concepts of ecological validity 
and functional validity. 
lidity is the correlation between the cue 
and an 
dicted by it. 
correlation 


was 


Ecological Va- 


environmental variable 


pre- 
Functional validity is the 
the and a re- 


sponse, such as a judgment, directed 


between cue 
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ersity of California, Los Angeles 


the environmental variable. 
Stimulus attributes, then, can be dealt 
with as joint cues to some criterion vari- 
able. 


toward 


A correlational approach to cue 
utilization provides a method of abstract- 
ing from a series of responses the simul- 
taneous effects of several cues. 

Brunswik assumed that the functional 
validity of a cue will correspond, in most 
; the extent 
to which each cue is used should come, 
through learning, to conform to the ex- 
tent to which the cue predicts. With 
correlational analysis Brunswik investi- 
gated, for the part, established 
patterns of cue utilization, rather than 
the learning process by which individuals 
acquire such patterns. He attempted to 
ascertain the use of cues in settings repre- 
sentative of the natural 
To isolate the effect of cue validity, 
however, it seems desirable to establish 
ecological validities for originally mean- 
ingless cues in a laboratory setting and to 
the to 


cases, to its ecological validity 


most 


environment. 


observe responses learned these 
cues. 

An attempt to study the learning of 
functional validities 
situation was made 


(Bruner, Goodnow, 


in a_ controlled 
by R. Goodnow 
& Austin, 1956). 
The validity of a cue for predicting a 
correct category was defined as the rela 
tive frequency of the cue’s association 
with that category. The results did not 
indicate any close correspondence be- 
tween cue 


and objectiv e 


are 


use validity 
there limitations in the 
treatment of this question in terms of 
frequencies rather than correlations. 
One limitation is that probability match- 
ing, which was considered the criterion 
of appropriate functional validity, would 
not have maximized success in the situa- 
tion and, did not have the 
adaptive character of a match of func- 


However, 


therefore, 


tional to ecological validity in a correla- 
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rhree stimulus forms. 


tional framework. Smedslund (1955), 
dealing less directly with the relation 
between ecological and functional validi- 
ty, approached the topic of cue learn- 
ing through a correlational framework. 
He established joint 
predicted variable. 


visual cues to a 
A correlation was 
imposed between each of the cues and 
the correct response, giving each cue a 
different validity. The main conclusion 
was that Ss do learn “to utilize many 
probabilistic cues simultaneously “ 
(Smedslund, 26). In Smeds- 
lund’s design, however, a particular cue 


1955, p. 


was always associated with the same 
validity, so that the effect of validity 
could not be studied independent of the 
effects of saliency or other cue charac- 
teristics. 


In the present study, correlations 
were imposed between each of three 
simultaneously presented visual cues 
and a variable, correct line length, 
whose magnitude varied with the 
magnitude of all three cues. Each 
of the experimentally imposed correla- 
tions will be called a cue weighting. 
Correlations were taken between cor- 
rect line length and S’s response 
throughout a learning session, and 
between each of the individual cues 


and S’s response. These empirically 


derived correlations will be called 
response weightings. 

It was expected that the order of 
utilization of the three cues would 
come to conform to the order of cue 
validity. Specifically, it was pre- 
dicted that the response weightings 
would be different and would be 
ranked in the order of the cue weight- 
ings, and that the magnitude of the 
response weightings would approach 
that of the cue weightings. 


METHOD 


Subjects—The Ss were 30 members of a 
ninth grade class, 20 boys and 10 girls. This 
was a special class composed of students of 
high academic performance. 

Materials —The stimulus materials were 
384 geometrical forms photographed on 35- 
mm. Kodachrome slides and projected on a 
screen from an automatic projector. Each 
slide bore a number giving its position in the 
standard order of presentation. The pro- 
jected form was approximately 2 ft. in diam- 
eter. Each form consisted of an 
triangle on a circular white background. A 
portion of the triangle was black; the re- 
mainder was of some hue. 
istics of 
slide. 


isosceles 


Three character- 
these stimuli varied from slide to 
These were color, angle, and area. 
Phat is, the hue of the triangle, its orientation 
with respect to the vertical, and the propor- 
tion of the triangle that was colored rather 
than black. Each of the three characteristics 
varied over eight values; these were desig- 
nated 1 through 8 (see Fig. 1 

A hue of red was assigned Value 1. The 
other hues and their values were red-orange 
(2), yellow-orange (3), yellow (4), yellow- 
green (5), green (6), blue (7), and violet (8). 
The two equal sides of the triangle were the 
long sides; the ratio of long to short was 5:2. 
If the point opposite the short side is regarded 
as an indicator, the orientation of the triangle 
can be described in terms of degrees of clock- 
wise rotation from the 0° (12 o’clock) position. 
Values were assigned to these positions as 
follows: 22.5° (1 67.5" (2), 213235" (8), 
157.5° (4), 202.5° (5), 247.5° (6), 292.5° (7 
337.5° (8). The triangles varied along the 
dimension of area from almost completely 
black (1), in equal increments of altitude, to 
almost completely colored (8). The black 
strip always paralleled the short side of the 
triangle. The values (1 through 8) of the 
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three dimensions varied independently of 
one another. A standard stimul 


presented to all Ss. 


IS series Was 


[he response materials consisted of book- 
lets of paper on which were printed horizontal 
long. | 
an elongate d arrow, pointing toward the right 
rhe first 2 pages of a booklet contained 64 

numbered 1 to 64, corresponding to 
numbers on the first 64 slides. The next 16 
5 


pages contained 256 bracketed pairs of lines, 


lines approximately 7 in. cach line was 


lines, 


each pair given a number corresponding to the 
number in the 
line ol eat h pair was (¢ omple tely covere d by a 
cardboard strip, which was lightly pasted 
at both ends. Hidden by this strip was a 
small red mark which designated correct 
length for that line he last ot 
the booklet were identical to the fir 2 
Procedure and design.—All Ss were tested 


in a single session of approximately 2 hr. Two 


slide sequence. The lower 


Es were present throughout the experiment 
The instructions were a fairly straight-for- 
ward description of the i 
The following is an 


task and its signifi- 
cance. 
instructions. 
rhe experiment will investigate how w 
learn to make certain kinds of judgments. 
You will see a series of slides projet ted ona 
like this. Each of these 
contains the information you need to tel 
partic ilar length 
ke this 
You will just make a small dash, measuring 
from the left, for how long 
should be 


sometimes there. 


screen lorms 


mark off a 


you how to 


along an arrow on answer sheets li 
you think the 
Sometimes it will go here, 

Each slide contains all 
mark off the 


line 


the information you need to 
correct length. Your task in the experi- 
ment is to lear: this informa- 
tion \t first you will get no help in 
mating how lor gy the line 
you will be able to fi 


how to use 
esti- 
Later 
g the 


should be 
d out how lo 
line should be after making your ow 
The answer sheets for most of the experi- 
ment will have arrows attached 
this. The correct answers 

by paper strips. You can 
right on the line over the 
pull down or tear back the strip to see the 
The you can compare 
with the mark made and 
Be careful ot 


1 guess. 
two like 
cove red 
1 mark 
strip ind ther 


correct 
this 
with the picture 
at the correct mark until r you make 
your own. Each design on the slide tells 
you how long the line should be by the 
features that irom slide 
Which way the figure 
is on the slide, and the 
all clues to how | 


answer. 
inswer 


1 . 
to look 


cha ge slide to 
which color 
int of color are 


the line should be, 


imMotl 


ong 


and just how, you will have learn 


during the experiment. these 
features at the same time you may learn 
to get the line length just right. This is 
difficult but the closer you can come the 
better. The first two pages are a prelimi- 
nary measurement before you get help in 
estimating how long the line should be 
rhe last two pages, also, do not have correct 
answers marked. You will see each slide 
only a short time so you will have to judge 


pretty much by your first quick impressior 


Usir g al 


A series of 384 slides was presented; each 
slide was projected for 10 sec., with a 2-sex 
interval During each ex 
posure S viewed the slide, made a mark on the 
upper the corresponding line-pair, 
tore back the cardboard strip, and con pared 
the correct line length marked on the lower 
line with his mark and with the stimulus 
pattern. For the first 64 the last 64 
trials, where no correct length was given, S 
simply viewed the stimulus and marked the 
This provided a measure of operant 
level and an extinction series. A 10-min. rest 
period was given in the middle of the presen- 
tation. 

The stimulus 384 
composed of six blocks of 64 each. The last 
block (E) identical to the first (O 
Within each block of 64, each of the eight 
values of each cue appeared equally often in 
combination with each value of the 
two cues. Each blox k was based on one ol 
five different latin squares, 
complete set of 8 X 8 latin squares (Fisher 
& Yates, 1948, p- 63 The effect of this 
arrangement was to provide a zero correlation 
between any two cues within each block and 
to make almost all stimuli different, aside from 
the extinction Within lock 
the order of the various combinations of the 
three cues was randomized, and the distribu- 
tion of values for each cue 

The relation of the cue values to the 
responding series of correct line lengths was 
defined in the following way. Correct 
length was made a function of the 
cue values, as expressed by the equation 
CLL = 2C,; + 1.5C2+C;, where CLL is 
correct line length, and the Cs are values of 
the three cues The coefficients of the Cs 
determine the cue Since the 
minimum cue value was 1, and the maximun 
8, correct line length varied from 4.5 
all cue values were 1, to 36, when all « 
the value 8. The relation between each cue 
and correct line length can also be expressed 
in terms of a correlation. Since 
uncorrelated, the correlations of the three cues 


between slides 


line of 


and 


line. 


figures was 


series ol 


was 


othe r 


selected from a 


series. each 


1 
was rectangular. 


cor- 


line 
thre e 


weightings 


whe I 
1es h aid 


the cues were 
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TABLE 1 


EXPERIMENTAL CONDITIONS: WEIGHTINGS 
ASSIGNED EACH CUE IN DETERMINING 
Correct LINE LENGTH 


“a 
Cue Type and Weighting 


Condition | 


74 .56 


Angle 
Area 
Color 
Area 
Color 
Angle 


Color 
Color 
Angle 
Angle 
Area 
Area 


Angle 
Area 

Color 
Angle 
Color 


AM ewe 


with correct line length were in the ratio 
2:1.5:1. Since the square of a correlation 
corresponds to the proportion of total variance 
predicted, it is possible to calculate the value 
of the correlation coefficient for a cue by the 
equation: 


(2r)? + (1.5r)? + 7 1.00 
ry = 371 


Thus the three correlations, or cue weightings, 
were .371, .557, and .743. The procedures 
provided S with all the information necessary 
to make a correct response if he used each cue 
correctly. 

In order to ascertain the effect of various 
cue weightings independent of other proper- 
ties of the cue, six experimental conditions 
were provided. The design was completely 
counterbalanced with respect to cue type 
(color, angle, area) and cue weighting and 
their combinations (see Table 1). These six 
conditions were six different sets of correct 
line lengths for the single series of slides. The 
procedure for the various conditions differed 
only in the construction of the response 
booklet, that is, in the correct line lengths. 
All six experimental groups participated in 
the experiment together, being presented with 
the stimulus series simultaneously. The Ss 
were randomly assigned to experimental 
conditions. Data were obtained from 33 Ss; 
data from 1 randomly chosen S in each of 
three conditions were discarded to provide 
5 Ss in each condition. 


RESULTS 
For each block of 64 presentations, 
four Pearson correlation coefficients 
were computed for each S. These 
were the correlations between his 
response and each cue, as well as 
between his response and correct 
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line length. The correlations were 
averaged over Ss by converting each 
correlation to Fisher’s z, averaging the 
z's, and converting the average to a 
Pearson r. Curves showing average 
response weightings for each cue 
weighting are presented in Fig. 2. 
The vertical position of the curves 
corresponds to the order of the cue 
weightings. 

An analysis of variance was per- 
formed on z transformations of the 
response weightings for the learning 
blocks, the four blocks in which cor- 
rect line length was presented to S. 
The main question to be answered 
by the analysis is whether Ss were 
learning to respond differentially to 
the different cue weightings; that is, 
whether there was a significant dif- 
ference in vertical placement of the 
curves. The analysis (Lindquist, 
1953, Type VI) is summarized in 
Table 2. The F for cue weighting is 
significant beyond the .001 level. 
This result confirms the prediction 
that the response weightings would 
differ with the cue weightings. 

In addition, there is an over-all 
trend toward increasing cue utiliza- 
tion over these four blocks. The 
block effect is significant beyond the 
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rABLE 2 


ANALYSIS OF VARIANCE OF RESPONSI 
WEIGHTINGS I 


TRANSFORMATION 


SING A 


Between Ss 
Conditions (C 
Error (b) 

Within Ss 
Cue weighting 
Blox ks B 
WxXB 
WxC 
BXC 
WXBXC 
Error (w) 

Error; 
Errore 
Error; 


*P < .0OS 
*P < .OO1 


ascertain 
toward 


order to 

trend 
increasing utilization of each separate 
cue, the block effect was tested for 
The 
the 
2.79, 


.005 level. In 


whether there is a 


each cue weighting separately. 


block effect was significant for 


74 weighting (P < .05, F 


df 3/72) and for the .56 weighting 


(P < .025, F = 3.34, dj 3/72), but 
not for the .37 weighting (/ < 1). 
The form of the learning curves does 
not differ for different cue weightings; 
the W X B 
nificant. 


interaction is not sig- 

The question of the final magni- 
tudes of the response weightings could 
not be answered since there was no 
indication that learning had reached 
a limit. It is possible, however, to 
take the correlation of 
response with correct line length on 
a given block and to compute what 
the three response weightings would 
be if they contributed to this total 
the ratio 


response 


observed 


correlation in expec ted 
2:1.5:1. The 
computed on this basis are presented 
in Table with the corre- 
sponding observed weightings. Table 
that utilization, 


weightings 
3, along 


3 indicates cue 


throughout the learning trials, is 
roughly proportional to cue validity, 
although the highest-weighted cue is 
slightly more, and the 
slightly less, than would be 
predicted on the propor- 
tionality. 


being used 
lowest 


basis of 


DISCUSSION 


rhe results indicate that Ss responded 
simultaneously and differentially to the 
multiple cues. The differential respons« 


is shown by the highly significant cue 


weighting effect. Simultaneous response 


to the multiple cues, rather than use 


by each S of only a single cue, is indi 
cated by the fact that 
transcends the 


interaction contained in 


this weighting 
SX Weighting 
the error term 
Also, once Ss are exposed to 
the 


response with correct line length is higher 


| rr Ts 


correct line length, correlation of 


than with the most heavily weighted cue 
), indicating successful use of 
valid cue by itself. 


more than the most 


Individual Ss must have responded to 
during given 
The curves suggest that 


the 


than 
bloc k of trials. 


there is 


more one cue any 


some response to lowest 


rABLE 3 


OBSERVED MEAN RESPONSE WEIGHTINGS (O 
COMPARED WITH RESPONSE WEIGHTINGS 
COMPUTED ON THE BASIS OF 

PROPORTIONALITY 
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weighted cue, although in the statistical 
test this was not rising significantly. 
In general the Ss made an appropriate 
use of the cues available. 

The importance to adaptive behavior 
of appropriate response to multiple cues 
has been emphasized for various situa- 
tions: in terms of the learning of func- 
tional relations (Miller, 1959), for pre- 
diction of judgmental behavior (Johnson, 
1955), with application to clinical ap- 
praisal (Hammond, 1955; Hoffman, 
1960). Important differences of em- 
phasis are implied in the choice of any 
one methodological framework for the 
study of cue utilization. In particular 
one may contrast the orientation implied 
by a correlational framework with that 
implied by an event frequency frame- 
work, such as is used in studies of prob- 
ability matching. Both frameworks may 
be used to study adaptation to environ- 
mental uncertainties. The 


event fre- 


quency situation provides for the success 
of a given response to be probabilistic. 
Usually the best adaptation possible is 
consistent choice of the most frequent 
alternative, with probability matching 
implying less than optimal adjustment. 


In a correlational framework, the prob- 
abilistic character of the individual cues 
does not necessarily adhere to the cues 
in combination, which may be unequivo- 
cally related to the criterion as they 
were in the present study. In terms of 
behavior, the matching of response 
weighting to cue weighting would imply 
fully optimal adjustment to the situa- 
tion; there may be consistently accurate 
responses on the basis of individually 
limited cues. A correlational frame- 
work, then, emphasizes cue utilization 
as a process which tends to remove un- 
certainty. Much of our judgmental and 
discriminative behavior is characterized 
by a sufficient degree of sureness and 
adaptiveness to be best studied within 
such a framework. 


SUMMARY 
This experiment investigated the relation 


between the objective validity of certain cues 
and the extent to which these cues were used. 


SUMMERS 


The purpose was to analyze the learning of 
responses to multiple cues of different validi- 
ties, in order to determine how much the 
responses came to depend on each cue. The 
independent variable was the correlation 
imposed between a criterion and each of three 
simultaneously presented visual cues. The 
dependent variable was the correlation 
between the cues and Ss’ responses. 

The Ss were 30 members of a ninth grade 
class. The stimulus materials were geometric 
forms. Three characteristics of these stimuli 
varied, acting as cues for the prediction of a 
line length. These characteristics were based 
on the color, angle, and area of parts of the 
forms, and were counterbalanced to establish 
independent cue validities. The cue validities 
of .74, .56, and .37 together permitted a per- 
fect prediction of line length. During four 
blocks of 64 trials each, Ss responded and 
then were presented with the correct line 
length. 

Successful cue utilization increased during 
the learning session. The Ss responded to 
different cues simultaneously, and the extent 
to which cues were used differed with validity. 
Cue utilization was roughly proportional to 
cue validity throughout the learning trials. 
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STIMULUS-RESPONSE CONTIGUITY IN CLASSICAL 
AVERSIVE CONDITIONING! 


R. A 


/ 
i 


Stimulus-response contiguity is 
widely accepted as a necessary, if not 
sufficient, condition of learning. In 
particular, it has been suggested that 
even though reinforcement may be 
required for complete learning, a 
response which is being learned can- 
not be reinforced until it has begun 
to occur and that S-R continguity 
may operate as a crucial factor in 
initiating any learning process (Mason, 
1959). 
gree of S-R contiguity is represented 
in the temporal interval between the 
CS and the UCR, and the following 


In classical conditioning, de- 


experiment was designed to test the 
hypothesis that, early in conditioning, 
performance is proportional to degree 
of CS-UCR contiguity. 


In order to 
make this test a short-latency re- 
sponse (eyeblink) and a long-latency 
response (GSR) conditioned 
simultaneously in each S and the 
time relations between CS and UCR 
were varied so as to permit greater 
contiguity of eyeblink and CS in one 
group of Ss and greater contiguity of 
GSR and CS in a second group. This 
type of arrangement is represented 
in Fig. 1; with an air puff as the UCS, 
the CS (tone, T) may be presented 
near the unconditioned eyeblink as 
in the case of T1, or near the uncondi- 
tioned GSR as with T2. In terms of 
the contiguity hypothesis it was pre- 
dicted that the eyeblink would condi- 
tion better than the GSR with T1 
whereas the reverse would hold with 
T2, 1.e., that there would 


were 


be inter- 


1 Part of the apparatus used in this study 
was provided by Commonwealth of Australia 
Research Grant No. 2002. 


CHAMPION 


oss yd ne \ 


action between the CS-UCS interval 
and the latency of the UCR as inde- 
pendent variables affecting 
tioning. By the same token it was 
expected that the early conditioning 
of the eyeblink would be better with 
T1 than with T2 but that T2 would 
produce the superior performance in 
the case of the GSR. 

The experiment provided an op- 
portunity to confirm an earlier finding 
that backward conditioning produced 
some true learning of the long-latency 
GSR (Champion & Jones, 1961). 
It is apparent from Fig. 1 that UCS- 
CS trials must be used in the condi- 
tioning of the GSR if optimum CS- 
UCR contiguity is to be achieved and 
this arrangement was therefore em- 
ployed in the second of the two groups 
described above. After these two 
groups had been trained, however, 
the question arose as to the real mean- 
ing of “backward conditioning.’’ The 
accepted definition seems to be put 
simply in terms of the presentation 
of the UCS before the CS, but this 
does not guarantee that the UCR will 
precede the CS. In fact, in the second 


condi- 














Fic. 1. 


relations 


Schematic representation of time 
between stimuli 
the three groups of the experiment 


and responses in 
(Tl, TZ, 


and T3 represent the location of the CS in 
Groups 1, 2, and 3, respectively. 
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arrangement described above, favor- 
ing contiguity of GSR and tone (T2 
in Fig. 1), the CS still precedes the 
UCR. A third group was therefore 
trained with an even longer backward 
UCS-CS interval so as to ensure that 
the unconditioned GSR preceded the 
CS (T3 in Fig. 1). An attempt was 
made to set this interval so that 
degree of contiguity was nevertheless 
approximately the same as in the 
second group, i.e., so that CS-UCR 
contiguity matched UCR-CS con- 
tiguity. The use of this third group 
allowed a test of the possibility that 
R-S and S-R contiguity are equally 
effective. 
METHOD 


Subjects —The Ss were 49 volunteers from 
courses in psychology at the University of 
Sydney. 

A pparatus.—The general experimental sit- 
uation and the apparatus for measuring the 
GSR have been described elsewhere (Cham- 
pion & Jones, 1961). The eyeblink was 
measured with the system developed at the 
State University of Iowa laboratory, i.e., 
a microtorque potentiometer linked me- 
chanically to S’s right eyelid and electrically 
to an ink-writing recorder through a dé 
amplifier. The CS was a 2000-cps, 90-db. 
tone of 50-msec. duration 
through headphones. The UCS was a 3.0-psi 
air puff applied to S’s right eye through a 
.062-in. diameter tube and set at 50 msec. 
duration by means of an ac solenoid valve 
All time intervals except the intertrial interval 
were controlled with electronic timers. 

Procedure.—In the course of general in- 
structions Ss were asked to keep their eyes 
open except for normal blinking. The series 
of trials for each S began with one UCS-alone 
trial followed by one CS-alone trial. The 
remaining presentations of stimuli consisted 
of training trials (CS-UCS or UCS-CS) inter- 
spersed with test trials (CS alone) in the 
following order: 3 training, one test, 7 train- 
ing, one test, 10 training, test trial 
Thus test trials to progress of 
conditioning were administered after 0, 3, 10, 
and 20 training trials. The training series 
was limited to 20 trials because the hypothesis 
under test dealt 
of conditioning. 


and one 


assess the 


with the early 
rhe intertrial interval aver- 
and varied between 25, 30, and 


only stages 


aged 30 sex 


delivered to S 
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35 sec. in prearranged random order. No 
ready signal was given at any stage. 

As represented in Fig. 1, Groups 1 and 2 
were conditioned with a CS-UCS interval of 
400 msec. UCS-CS interval of 1200 
msec., respectively. Group 3 was then 
trained with a UCS-CS interval of 2800 msec. 
A conditioned eyeblink was defined as occur- 
ring on test trials with a pen deflection of 1 
mm. or within the interval 200-500 
msec. following the onset of the CS. No 
attempt made to exclude “voluntary 
responders.”” A GSR on test trials was taken 
to be any response occurring in the interval 
1—4 sec. after the CS. There was some adap- 
tation of the GSR to the air pull and the 
results of 4 Ss who failed to give a response 
on 6 or more of the 20 training trials were 
discarded. This left 15 Ss in each of the three 
There were 7, 8, 


and a 


more 


was 


groups. and 6 women in 
Groups 1, 2, and 3, respectively 
the results that the men gave in- 
significantly more CRs of each type compared 
with the women. 


; a survey of 
showed 


RESULTS 


In view of the predicted interaction 
between the length of the CS-UCS 
interval and the latency of the UCR 
it was necessary to make a direct 
comparison between the performance 
of the eyeblink and the GSR. For 


this purpose a technique was used 
which had already proved effective 
(Mason, 1959) and which allowed a 
measure of the frequency of the condi- 


tioned GSR. 


scores 


S the GSR 
conductance in 
test trials 
during training were expressed as a 
ratio of the response on the first 
test trial, before training; this trans- 
formation overcame individual dif- 
ferences in general sensitivity of 
the few where 
there was no response on the first test 
trial the measurable response 
of 0.1 micromhos was assumed. The 
transformed scores, ranging from 0 
to 35.0, were pooled and found to have 
a median of 2.6. 


For each 
(change in 


micromhos) obtained on 


response. In cases 


least 


Each value at or 
above 2.6 in the transformed scores 
was then taken as a conditioned GSR. 
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The eyeblink and the GSR were 
conditioned simultaneously in each 
S and it was possible to test for the 
predicted interaction by comparing 
across groups the proportion of Ss 
eveblinks 
the test 
These data 
the result for 
was in accord with 
exact 


conditioned 
than GSRs on 
trials training. 
are set out in Table 1; 
Groups 1 and 2 
the prediction 


giving more 
conditioned 


during 


and a Fisher 


probability test (Siegel, 1956), ap- 


plied with cases of equal frequency of 
response omitted, showed the inter- 
action to be significant (P < .01 for 
a two-tailed test). Comparison of 
these results with the data from Group 


rABLE 1 


NUMBER OF More, EQuat, 
FEWER CONDITIONED EYEBLINKS 
COMPARED WITH CONDITIONED 

GSRs on TEs1 

DURING 


Ss GIVING AND 


[RIALS 
[RAINING 


Eyeblinks > GSKs 
Eyeblinks = GSKs 
Eyeblinks < GSRs 


the 
nificant for Groups 1 and 3 (P 


but not for Groups 2 and 3. 


to be sig- 


.05) 


3 showed interaction 


The other aspect of the predic ted 
interaction involved within-response 
comparisons over the various CS-UCS 
curves for 
the eyeblink CR are shown in Fig. 2. 
The performance of Group 1 


intervals. Performance 
was 
consistently superior to that of Group 
2, as predicted, but the application 
of Fisher that a 
tistically significant difference did not 
emerge until 20 training trials 
(P < .01); at 10 trials the difference 
approached significance (.10> P>.05). 
The same relationship held between 
Groups 1 and 3 but there 
significant difference at any 


tests showed sta- 


was no 
stage 


ince curves 


conditio 


between Groups 2 and 3. An overall 
test at 20 trials proved significant 
(x? 12.09, P < .01, df 2). These 
results conform to the general finding 
that a 400-msec. interval is almost 
optimum for the eyeblink and that 
backward 
sponse is ineffective. 
The 
sponse 
GSR, 
the 
are shown in 


conditioning of this re- 
within-re- 
the 
for 


more important 


comparison concerned 


and performance curves 
conditioning of response 

Fig. 3. As predicted, 
Group 2 was consistently superior 
to Group 1, but according to Fisher 


the 


this 


tests difference was only sta- 
tistically significant at 3 trials 
(P < .05). Groups 1 and 3 did not 


differ significantly at any stage. An 


Performance curves for GSR 


conditioning. 
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overall test at 3 trials gave a signifi- 
cant result (x?=6.66, P<.05, df=2). 
More sensitive measures than fre- 
quency of response were available 
for the GSR and the transformed 
amplitude scores were studied for 
further information. Nonparametric 
statistical tests were used because of 
the marked positive skew in the 
distribution of these scores. The 
results given above were confirmed; 
in addition, the application of a U 
test at 20 trials showed that the 
difference between Groups 1 and 2 
approached significance (U = 68, 
10> P> .05). At this stage there 
was a significant difference in ampli- 
‘tude of response between Groups 2 
and 3, with the former superior 
(U = 51, P < .05), but not between 
Groups 1 and 3. An overall test 
applied to the three groups at 20 
trials proved significant (JJ = 6.99, 
P < .05, df = 2). 

As a check on the possible intro- 
duction of some artifact in the trans- 
formation of the GSR scores, Mood’s 
test of trend (McNemar, 1955) was 
applied to the raw change-in-con- 
The 
observed trends in these data were 
virtually identical with those in the 
frequency data, depicted in Fig. 3; 
they were significant for Groups 2 
(x? = 11.8, P < .01, df = 3) and 3 
(x? = 11.4, P < .01), but not for 
Group 1 (x? = 2.20, P > .50). 


ductance scores for each group. 


DISCUSSION 


The predictions about the effects of 
S-R contiguity in classical conditioning 
were confirmed by the data. It may be 
asked why exact contiguity of CS and 
UCR was not chosen as the condition 
most likely to yield optimum perform- 
ance early in conditioning; with the 
assumption of latencies of 200 msec. 
and 2000 msec. for the unconditioned 
eyeblink and GSR, respectively, this 


would have been achieved with the use 
of corresponding backward UCS-CS 
intervals. A pilot study conducted 
along these lines before the present 
experiment produced results exactly in 
keeping with the contiguity hypothesis, 
but the level of conditioning in both 
groups was so low that the interaction 
was not statistically significant. Poor 
conditioning with a short backward 
interval is an established result for a 
short-latency response (e.g., Wolfle, 
1932) and the results of Groups 2 and 3 
in the present experiment suggest that 
there is a decrease in the effectiveness of 
GSR conditioning in early stages with 
a backward interval somewhere between 
1200 msec. and 2800 msec., possibly of 
the order of 2000 msec. An obvious 
inference to be drawn from these con- 
siderations is that the required con- 
tiguity lies not between CS and UCR as 
overt events but rather between cor- 
responding physiological processes, lag- 
ging behind the CS in stimulus reception 
and preceding the UCR in response 
evocation. 

It is tempting to suppose that the CS 
must precede the UCR for contiguity 
to be effective. This would explain why 
the conditioning of the eyeblink did not 
occur in Groups 2 and 3, and why Group 
3 was inferior to Group 2 and comparable 
with Group 1 on some counts in the 
conditioning of the GSR. The supposi- 
tion is contradicted, however, by the 
comparable performance of Groups 2 
and 3 with respect to the relative fre- 
quency of the two types of CR (Table 1) 
and by the presence of a significant trend 
in the GSR conditioning of Group 3. 
Both these latter results are probably due 
to the early rise in the performance of 
Group 3, at three training trials. Con- 
tiguity appears to have had some effect 
when the UCR preceded the CS (Group 
3), but the performance was inferior to 
and not as sustained as that obtained 
with a comparable degree of CS-UCR 
contiguity (Group 2). Such an outcome 
could be due to two factors which are not 
mutually exclusive. First, if allowance 
is made for the physiological processes 
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delineated above, then the contiguity 
between stimulus reception and response 
initiation would be greater in Group 2 
than in Group 3 (Fig. 1); under the 
conditions of Group 3, of course, some 
physiological mechanism must be found 
which permits contiguity to act when 
response initiation precedes stimulus 
reception. Second, if delay of reinforce- 
ment is represented in the interval be- 
tween CR and UCS, then it is greater in 
Group 3 than in Group 2 and perform- 
ance in the former group should not have 
been so sustained as in the latter group 
for that reason; this consideration brings 
with it the problem of “backward” 
reinforcement when the UCS precedes 
the CR. More generally, it may seem 
improper to invoke the effects of rein- 
forcement if attention is limited to the 
initial 
tiguity has been 


stages of learning, where con- 
taken as the prime 
factor operating. Reinforcement 
take effect, 


learned 


may 
soon as the 
matter 
how weakly, and the precise separation 


however, as 
response appears, no 
of this effect from that of contiguity 
awaits the use of more refined 
technique than mere reference to early 
and late stages of learning. 
Attention should be drawn to 
other features of the data. There is 
some conflict with the results of the pre- 
vious study (Champion & Jones, 1961), 
where the backward conditioning of the 
GSR (with a UCS-CS interval of 500 
msec.) was inferior from the outset to 
forward conditioning (with a CS-UCS 


some 


two 


interval of 500 msec.) whereas the per- 


formance of Group 2 was superior to that 
of Group 1 in the present study. The 
discrepancy might be accounted for in 
terms of differential degrees of contiguity 
and reinforcement, but the differences 
in procedure in the two experiments are 
too great to permit this explanation to 
be pursued with any confidence. A 
second point of interest was the appear- 
ance in the present data of a statistically 
significant contiguity effect early in 
training with the GSR but relatively 
later with the eyeblink. This result 
would emerge if greater contiguity was 


achieved by chance in Group 2 with the 
GSR than in Group 1 with the eyeblink. 
It should be noted that the only set of 
conditions favoring performance on the 
grounds of both contiguity and rein- 
forcement was that prevailing in Group 
1 for the eyeblink and that only under 
these conditions did the performance 
curve rise throughout its course. 


SUMMARY 


A test was made of the hypothesis that 
CS-UCR contiguity is an important factor 
in the initial 
conditioning. 
and 


stages of classical aversive 
The short-latency eyeblink 
long-latency GSR were conditioned 
simultaneously in each S, with tone as CS 
and air puff as UCS. A forward CS-UCS 
interval of 400 msec. and a backward UCS-CS 
interval of 1200 msec. were used in separate 
groups to favor contiguity of CS with eye- 
blink and GSR, respectively. The acquisition 
of the eyeblink was superior with the forward 
interval and inferior with the backward 
interval when compared with that of the GSR. 
Within-response comparisons showed that the 
eyeblink conditioned better with the forward 
interval whereas the GSR conditioned better 
with the backward interval. To test the effect 
of UCR-CS contiguity in the case of the GSR 
a third group was trained with a backward 
UCS-CS interval of 2800 msec.; this condi- 
tion produced an initial rise in performance 
which was not sustained. The results of the 
experiment were interpreted as supporting 
the hypothesis. 
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ASSOCIATIONS, SETS, AND THE SOLUTION OF 
WORD PROBLEMS ! 


MIRIAM 


Johns Hopk 


Investigators have used word prob- 
lems (anagrams and skeleton words) 
to explore the ‘‘category set,’’ which 
can be defined as a readiness to re- 
spond to words belonging to a common 
category or class, and can be meas- 
ured by the speed or frequency* with 
which certain responses occur. 
example, Starch (1911) and others 
demonstrated the existence of the 
category set when Ss solved problems 
made from lists of words belonging 
to a common class more rapidly than 
problems made from 
lists. 


For 


random word 
Later, Rees and Israel (1935) 
and others measured the strength of 
the category set by the frequency 
with which Ss solved multiple solution 
problems as category after 
practice on unique solution problems 
which could be solved only as words 
of the selected category. Past Es 
reported that telling Ss the category 
name did not always have a con- 
sistent effect; what they are surer of 
is that the set arises from verbal con- 
text during problem solving. Unfortu- 


words 


1 This study is based upon a dissertation 
submitted to the Johns Hopkins University 
in partial fulfillment of the requirements for 
the PhD degree, and was supported in part 
by Public Health Service Fellowship Number 
MF 13,450 from the National Institute of 
Mental Health. The author expresses her 
thanks to James E. 
throughout the study. 

2 Personnel Research Psychologist, Bureau 
of Old-Age and Survivors Insurance, Social 
Security Administration, Department 
Health, Education, and Welfare. 

3 The probability of occurrence of response 
can be considered the definition of set. The 
probability of occurrence of a response is then 
inferred from the time it takes the response 
to occur or the frequency with which it occurs 
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University 


nately, the past Es selected the cate- 
gory words and category name on an 
a priori basis, and none of them sys- 
tematicaily investigated the verbal 
context triggering the set. The 
present study aims to show that asso- 
ciative strength between words from 
which problems are made is a per- 
tinent variable to manipulate in the 
operation of the 
anagram solving. 


category set in 


This approach stems from work in free 
recall showing that associative strength 
between words presented to Ss signifi- 
cantly determines Ss’ verbal productions. 
Additional support for this approach is 
given by Mayzner and Tresselt (1958) 
who suggest that processes in anagram 
solving may be described by laws similar 
to those in verbal recall. 

Especially relevant to the problem 
investigated here is a study by Deese 
(1959) who investigated the effect on 
recall of presenting Ss with lists of words 
all of which are with the 
average strength of association among 
the group expressed in an index he called 
interitem associative strength. He found 
that in ‘“‘organized”’ lists of words high 
on this index, Ss recalled more and had 
fewer verbal intrusions. Postulating 
that free recall is in part free associating, 
Deese explained that Ss recalled more 
with the organized lists because there 
was a greater probability of Ss’ associa- 
tions actually being listed items. 


associated, 


The present study extends Deese’s 
model of free recall to anagram solving 
and hypothesizes that the category 
set in anagram solving operates by 
associations between words selected 
by Es as belonging to a given cate- 
gory. On the basis of this hypothesis, 
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rABLE 1 


Six ORGANIZED Lists USED IN THE EXPERIMENT 


the following predictions were made: zero probability of eliciting each other in 
free association). The experimental design 


(a) Anagrams made from organized ; 
was such that the same sample of words in 


lists of associativels related words the same anagram form was used for all 
will be solved more quickly than experimental conditions. Thus, as many 
anagrams made from random word factors as possible were constant for all 
lists, because in the former, associa- conditions and whatever effects might occur 
tions called up by solved problems ld be attributed to the influence of 
: ° "ae ° e associative context 
should aid Ss in the solution of sub- lillian aiealingial a ey 
sequent ones. (6) There should be a als consisted of 36 anagrams. The 36 words 
rapid decrease in solution time with from which the anagrams were made came 
trials when Ss solve anagrams made {t0™ 6 organized lists of 6 words each. Thesi 
. . . lists were constructed after administering 
from associatively related words as “er 
: 2 : several word association tests consisting of 
associations called up by solved 200 items including filler items to groups of 
problems begin to aid Ss in the solu- from 50 to 100 Johns Hopkins University 
tion of subsequent ones. undergraduates he instructions given to 
Ss in this part of the study were standard 
METHOD word association test instructions 
The following governed the selection 
The study consisted of two parts: ; the six words of each organized list: 
selection of stimul materials and six words should hav 
experime t itself issociative stré gth, 
The experiment lhe experi NSISTE have a maximum proba 
of three major conditions , : ‘ . other in free associatio 
Cond. O-L groups of Ss were presentec ! be words from which only uni lution 
anagrams made from “organized” list i inagrams could be made Ihe six lists of six 
, 


words words that elicit each other i1 ° words each are shown i Fablk 


associatio! iF iddition, Ss assignec Labels de scribing the rya ized lists 
Cond. O-L were told labels describing HIS Cond. O-] were obtained as follows: A 
of words from which the anagran 

These labels were selected o! iSl 4 Two « 
normative data gathered prior to the experi solution anag : These are SOFA which 
ment. The labels were associates to the words could be structured OAFS and CIRCLE which 


f t words do not form unique 


of the list and in turn elicited then In Cond could also be CLE! | the anagram form 
R. Ss were given anagrams made from “rat in which they were pres ted, none of these 


} 


dom” lists of words (words with essentially solutions occurred 





MIRIAM A. SAFREN 


TABLE 2 


Over-ALL MEDIAN SOLUTION TIMEs (IN SEC.) FOR SIX ORGANIZED LISTS OF 
DIFFERING INTERITEM ASSOCIATIVE STRENGTHS WITH AND 
WITHOUT LABELS 


Interitem Associative Strength 





Mean } Variance 


No. Words Labels Elicit | 


Median Time | Median Time 
(No Labels) | (with Labels) 


Variance 








| 245.07 
| 146.86 
| 79.40 
95.46 
| 137.72 
| 


O90 HD ~3190 
—wrnoe © 


183.40 





1.32 
.96 
.90 

1.06 

1.69 

1.18 


mR NW 
Ned oo 
RoASAS 


| 


Note.—Median time scores for each list are based upon data of 6 Ss who solved six anagrams each. 


group of 60 Ss, Johns Hopkins University 
undergraduates, were given a 6-page booklet 
containing the six organized lists, one per 
page. The Ss were told to look at each list 
of words and to think of a label or labels that 
described them or the way in which they were 
similar, and to note this in the space pro- 
vided. The various labels and the frequency 
with which they were given were tabulated. 
The labels selected for each list were those 
which included 50% or more of the responses. 
The labels are shown in Table 1. 

As the labels of each list were associates 
to the words of that list, further normative 
data were obtained to find out if the labels 
would elicit the words of the list. A sample 
of 40 Ss, Johns Hopkins University under- 
graduates, were given a 6-page booklet con- 
taining the six sets of labels, one per page. 
The Ss were told to look at the labels on each 
page and then to write down the words which 
the labels called to mind, but to spend no 
longer than 1 min. associating to each set of 
labels. Table 2 shows the average number of 
words of each organized list which the labels 
of that list elicited. The mean number of 
words called up by these labels in 1 min. was 
10 (SD = 6.32). 


TABLE 3 
ANAGRAM SOLUTION TIMES (IN SEC.) FOR 
THE THREE CONDITIONS 


Condition | N 


Median Range 


2-67.8 
.1-49.2 
.6-22.4 


. | ol a 
O 36 
O-L 36 


Note.—Time scores on which medians, Qs, and ranges 
are calculated are the median solution times for each 
of the 36 Ss tested under each condition. 


For Cond. R in which Ss were presented 
with anagrams made from random lists of 
words, 36 random lists of six words each were 
generated as follows: One word at a time was 
taken from each of the 6 organized lists to 
form a random list of six words. By a pre- 
arranged order, every word of each organized 
list occurred in 6 of the 36 random lists but 
never in the context of any other word of the 
same organized list. 

In constructing the anagrams, an attempt 
was made to control for three factors that 
affect difficulty of solution (Mayzner & 
Tresselt, 1958, 1959; Sargent, 1940): (a) the 
degree of difference of the letter arrangement 
from the original order in the word, (b) the 
transitional probabilities between the letters 
in the disarranged form, and (c) frequency of 
occurrence of the word in printed matter 
(Thorndike & Lorge, 1944). 

Subjects—A total of 108 Ss from the 
introductory psychology class at the Johns 
Hopkins University were randomly assigned, 
36 per condition. For Cond. O and O-L there 
were 6 lists of six anagrams each. Different 
groups of 6 Ss received each of the 6 lists, 
and within each group the order of occurrence 
of the six anagrams was counterbalanced over 
Ss. For Cond. R, there were 36 lists of six 
anagrams each. Each of the 36 Ss assigned 
to this condition received a different list. 

Procedure.—All Ss were presented with 
six anagrams typed in capitals on 3 X 5 in. 
cards and told the following: (a) they would 
have six anagrams to solve; (b) an anagram 
is a word with the letters disarranged; (c) 
they were to look at the anagram, try to 
determine the original word without the use 


’ The 36 random lists are in the doctoral 
dissertation on file at the library of the Johns 
Hopkins University. 
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of paper and pencil, and then to give the 
solution orally; (d) they would have 4 min. 
to solve each problem and then would be 
given the solution; (e) they were to work as 
rapidly as possible as they were being timed 
on each problem. In the case of Cond. O-L, 
Ss were told that the anagrams were made 
from a list of words related by association, 
and given the labels that described the list. 
Solution times were recorded with a stop- 
watch, and all Ss were tested individually. 


RESULTS 


Because of the nature of the dis- 
tribution of Ss’ anagram 
times, medians rather than means 
were chosen to represent S’s typical 
performance in the statistical analysis. 
Table 3 summarizes the data on solu- 
tion time for the three conditions 
showing that medians, Qs, and ranges 
decrease in going from Cond. R to O 
to O-L. The overall statistical sig- 
nificance of the difference 
the three experimental conditions, 
using the Kruskall-Wallis one-way 
analysis of variance, is P < .005. 
A comparison of Cond. O with Cond. 
R by the Mann-Whitney U test 
yielded P < .02. A similar compari- 
between Cond. O and O-L 
yielded P < .003. 

With a significant difference be- 
tween Cond. R and Cond. O, a de- 
tailed analysis was made of the 
organized lists of differing interitem 
associative strengths. A Kruskall- 
Wallis one-way analysis of variance 
by ranks showed a significant dif- 
ference between 
(P < .05). However, the correlation 
(Kendall’s tau) between interitem 
associative strength index and median 
solution time for the organized lists 


Table 


solution 


between 


son 


organized lists 


was —.28 and nonsignificant. 
2 shows this information. 
Since the comparison between Cond. 
O and Cond. O-L was very significant, 
Mann-Whitney U tests were done 
comparing median solution times for 
the 6 Ss of each organized list with 


the comparable 6 Ss of the same 
organized list with labels. For Lists 
3, 5, and 6 there were significant 
differences between the two condi- 
tions (P = .021, .013, and _ .002, 
respectively). For Lists 1, 2, and 4 
the differences were not significant 
(P = .155, .242, and respec- 
tively). Therefore, it may be con- 
cluded that anagram solution is 
facilitated when Ss are given ana- 
grams made from “organized’’ asso- 
ciatively related word lists. Anagram 
solution is facilitated still further 
when in addition, Ss are told labels 
describing the organized lists when 
these labels are (a) associates to the 
words and (6) in turn call up the 
words of the lists. 

The overall Trial effect (Ss under 
all conditions had six problems to 
solve), as tested by the Friedman 
two-way analysis of variance by ranks, 
was significant (P < .02). 
there appears to be an interaction 
between trials and conditions 
Fig. 1), for the Trial effect is non- 
significant for Cond. R (.50<P<.70) 
and nonsignificant for Cond. O-L 
(10 <P <.20). The Trial effect 
is significant for Cond. O (P < .05). 
The significant Trial effect for Cond O 
verifies the predicted decrease in 
solution time with trials when Ss 
solve anagrams made from organized 
lists as associations called up by the 
context begin to aid Ss in the solution 
of subsequent problems. 


Bie 


However, 


(see 


With a significant Trial effect for 
Cond. O, analysis was made of the 
Trial effect for each organized list. 
Only the Trial effect for List 3 was 
significant (P < .02). However, the 
sample size for each organized list 
was only 6. 


Although 


the six organized lists 


vary little in the mean probability 
with which the words elicit each other 
in free association, there is a signifi- 





cs 
cs 








— ~~ 


fe) i i 


\ 2 3 


MEDIAN TIME IN SECONDS 








TRIALS 
Fic. 1. Median anagram solution time per 
trial for each of the three experimental condi- 
tions. 
36 Ss.) 


(Each point is based upo! the data of 


cant difference in the variance about 
the mean (see Table 2). Bartlett’s 
test for the difference between the K 
variance estimates significant 
at P<.05. Furthermore, there seems 
to be a relationship between the inter- 
item associative strength index and 
the decrease in solution time with 
trials for each organized list.6 Rank- 
ing each list by the magnitude of its 
variance and then ranking each ac- 
cording to the amount of change in 
solution time with trials results in a 
correlation (Kendall’s tau) of —.60, 
P = .068. This suggests that the 
greater the variance about the inter- 
item associative strength index the 
less the change in solution time with 
trials. 

A similar correlation (Kendall's 
tau) between the interitem associative 
strength of each list and the variance 
about this value is .60 (P .068). 
Thus, it appears that organized lists 
with higher  interitem 
strengths had a higher variance about 
this index which would tend to make 
the drop in solution time with trials less. 

Possibly, not controlling for the 
variance about the interitem associa- 
tive strength index was one reason 
why it was not possible to show a 


was 


associative 


6A table showing the trial effect for each 
organized list is in the doctoral dissertation 
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position relationship between the mag- 
nitude of the index and the median 
solution time for each organized list. 
It is also possible that other factors 
inhibiting this relationship were (a) 
the narrow range of interitem asso- 
Ciative strengths used, and (0) the 
inability to make anagrams of all 
organized lists of equal difficulty. 


DISCUSSION 
The 


that in the solving of word problems such 


results support the hypothesis 
as anagrams, the category set operates 
by associations between words selected 
by E as belonging to a common category. 
The results also suggest processes in the 
solving of word problems such as ana- 
be similar to those in verbal 
recall; for in verbal 


grams may 


both recall and 


anagram solving, associative strength 
between words significantly determines 
Ss’ verbal productions. 

A suggested model to explain the 
results is as follows: When Ss solve a list 
of anagrams, at least in part, they are 
calling up words to match the letters 
presented and sampling from a momen- 
tary 


\fter 


wi yrds 


available words. 
Ss have determined 


which 


response pool of 
some of the 
from the problems were 
these implicitly evoke 
others which are associatively related. 
part of the 

pool of 
sampling. 


made, words 


These associations become 


momentary available words 


from which Ss are Thus, 
implicitly evoked associations from the 
solution of preceding problems can 
facilitate the solution of subse quent pro- 
blems by converging upon words on the 
list from which the anagrams were made. 

When Ss are 


from 


made 
and 
also given labels for these words, these 


given anagrams 


associatively related words 
labels evoke at the outset a response pool 
of available words some of which con- 
verge upon solutions to the anagrams. 
Since both associations elicited by solved 
anagrams and those called up by labels 
increase the 
solution, 
quickly 


availability of anagram 
anagrams are solved 
under Cond. O-L. 


from the outset the 


most 
However, 


labels make 


since 
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some of the anagram solutions readily 
available, the Trial effect is not significant 
for Cond. O-L. 

Although different lan- 
guage, the present model of the category 
set agrees basically with that of Maltz- 
man and Morrisett (1952). The latter 
explain the set as follows: An anagram 


couched in 


belonging to a class, e.g., 


common 
“‘nature,’’ serves as a stimulus for the 
arousal of other words belonging to the 
selected class and in addition, a whole 
hierarchy of responses some of which 
belong to the 
which do not. 
selec ted 


selected class, some of 


However, since only the 


category responses are con- 


sistently reinforced in the experiment, 
these become dominant through medi- 
ated generalization. 

Maltzman Morrisett do not 
explicitly describe the mediated generali- 
zation 


and 


mechanism. However, in the 


yresent model, the Ss’ pool of associa- 
I I 


tively related words implicitly evoked 


by solved anagrams could be regarded 
as the 
nism. This pool of words would include 


mediated generalization mecha- 


the category name in so far as it is asso- 
ciated to the words comprising a common 
class and elicits these words. 

The present description of anagram 
solving in terms of verbal recall and free 
association may be considered an exten- 
Deese (1959 
interprets free recall as involving in part 
Thus the 


study represents an.extension over which 


sion of a model by who 


free association. present 
associative relationships between words 
plays an important role. 

he present research also implies that 
in future studies on the category set, 


stimulus materials should be calibrated 


by existing associative relationships for 
more precise predictions on the rate of 
problem solving. For it is the contention 
of this study that associative strength 


between words may be considered a 


which 


solutions of 


measure ol existing resbonse Sets 


when activated, facilitate 


word problems such as anagrams 
SUMMARY 


his experiment tested the 
that the category set 


hy pothe sis 


ram solution 


operates by associative strength between 
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words selected by E as belonging to some 
In Cond. O and O-L, 


given anagrams 


common class. 
of Ss were made from 
“organized” lists of words that elicited each 
other in free association. In Cond. O-L, Ss 
were also told labels describing the lists. The 


groups 


labels were associates to the words on the 
list from which the anagrams were made and 
in turn elicited these words. In Cond. R, Ss 
were given anagrams made from “random” 
word lists (words with essentially zero prob- 
ability of eliciting each other in free associa- 
tion 

evidence for the 
It was predicted that 
made from organized lists 
would be solved more quickly than anagrams 
made from random lists; for in the former, 


Time to solution 
ope ration of the set 


was 
(a) anagrams 
the context would 


there would be a 
rial effect (a decrease in solution 


associations called up by 

aid in problem solution; (b 
significant 
time with trials) for the organized condition 
as associations called up by context began to 
aid Ss in the solution of subsequent problems. 
The results verified these predictions and 
supported the present 
category set. A 


interpretation of the 
description of anagram 
solving as involving verbal recall and free as- 


sociation was advanced to expl iin the results. 
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SEQUENTIAL INTERFERENCES DEMONSTRATED BY 
SERIAL RECONSTRUCTIONS! 


E. B. COLEMAN 


Human Resources Research Office 


The findings of experiments on 
retroactive and proactive inhibition 
have been generalized to suggest that 
forgetting that occurs outside the 
laboratory is caused by interference 
similar to that found in the traditional 
laboratory experiment. Recently Un- 
derwood and Postman (1960) noted 
that ‘‘As plausible as this argument 
may seem, it is apparent that it would 
be highly desirable to specify possible 
sources of extraexperimental inter- 
ference, and to specify these in such 
a way that experimental tests of their 
influence on forgetting are possible” 
(p. 74). They suggested two such 
interferences: ‘‘letter-sequence inter- 
ferences” and ‘‘unit-sequence inter- 
ferences.” 


Letter-sequence interferences result 
when a list presented for learning 
conflicts with English spelling habits. 
Suppose S is presented with a con- 


sonant syllable JQB. Underwood 
and Postman assume that pre- 
viously learned spelling habits will 
make this sequence hard to learn 
because Q never follows J ; nor does B 
ever follow Q in English spelling. 
These previously learned spelling 
habits must be ‘‘extinguished’”’ or“ un- 
learned” or “inhibited’’ before JOB 
can be learned. Following Briggs 


! Based on a dissertation submitted to the 
Faculty of Philosophy of the Johns Hopkins 
University in partial fulfillment of the require- 
ments for the degree of Doctor of Philosophy. 
The guidance of James Deese is most grate- 
fully acknowledged. This research 
supported in part by a National Science 
Foundation Fellowship (30125) and in part 
by a Public Health Service Research Fellow- 


ship (MF-11, 727). 


was 


(1954), they also assume that as time 
passes, the older sequential habits 
will spontaneously recover and inter- 
fere with JOB during a test for reten- 
tion (proactive inhibition). The older 
spelling sequences may be used during 
the delay period also (common se- 
quences being used more frequently 
than uncommon ones). This will 
further strengthen them and further 
interfere with JOB (retroactive inhi- 
bition). These assumptions suggest 
that as time passes, S may replace 
JQB with a more common sequence 
perhaps JOB. 

The “unit” in unit-sequence inter- 
ferences is usually the word, although 
it could be any sequence of letters 
presented as an independent unit. 
Word-sequence interferences result 
when a sequence presented for learn- 
ing conflicts with syntactic habits. 
Suppose a list presented to S contains 
the sequence STAND DID LINGERIE 
THEY IN. Assumptions analogous to 
those given in the preceding para- 
graph predict that as time passes, S 
may replace such a sequence with a 
more familiar one—perhaps THEY DID 
STAND IN LINGERIE. 

To collect evidence for these two 
sources of interference, Underwood 
and Postman used serial anticipation 
learning and measured retention after 
1 week for four kinds of lists: common 
and uncommon words, and common 
and uncommon trigrams. As time 
passed, errors due to remembering an 
item in an incorrect order increased 
faster for word lists than for trigram 
lists. In a similar experiment, Post- 
man (1961) also found more such 
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intralist errors in recalling a list of 
common nouns than in recalling 
uncommon nouns. Both findings may 
be interpreted as evidence for word- 
sequence interferences. Underwood and 
Postman (1960) also found that as 
time passed, letter-sequence errors 
increased faster in the uncommon 
trigram lists than in the common 
trigram lists. This may be inter- 
preted as evidence for letter-sequence 
interferences. 

On the other hand, when the cri- 
terion is number of items recalled 
correctly, neither of the interferences 
have been revealed unless the analysis 
was restricted to the middle serial 
positions—to items that had not been 
overlearned. Thus evidence for the 
effect of these two sources of extra- 
experimental interference is not as 
strong as one would wish. 

The most direct and convincing 
evidence for sequential interferences 
would be to demonstrate that uncom- 
mon sequences are replaced by more 
common ones during recall. This 
investigation will attempt to demon- 
strate this by a variation of serial 
reproductions which might be called 
serial reconstructions. 


PROCEDURE 


The Ss were 27 introductory psychology 
students from the Johns Hopkins University. 
Experiment I.—The first S was presented 
with a list of letters 
its order. 


and asked to memorize 
Then he was given all the letters 

each one typed on a separate card 
asked to arra 


and was 
ge the cards in the correct order. 
His ordering was typed and presented to the 
sec ond S to me morize and to arrange in order, 
and this second ordering was typed and pre- 
sented to the third S, and so on. 

Eleven successive Ss 
memorize 


instructed to 
the order of six 
The Ss were tested 
one after the other, each one 
trying to memorize the six orderings of the 
preceding S The experiment 


were 
and reconstruct 
lists of scrambled letters. 

individually, 


discon- 
tinued after the eleventh ordering because 


was 


the orderings appeared to have reached a limit 
of no further approach toward English 

Six sentences ranging from 25 to 33 letters 
in length (counting spaces as letters) were 
selected from a second grade reader, and the 
individual letters in each sentence were 
scrambled. For instance, HE SAT DOWN might 
yield SHT EAW NDO. 

The first S was given 20 sec. to study each 
list (by complete presentation). Then he was 
given all the original letters of the list 
letter typed on a 


each 
separate card—and was 
asked to arrange them in the correct order 
Before he began to arrange them, he was 
required to write down all he could recall 
Because arranging the cards sometimes took 
him as long as 10 min., as he arranged the 
cards, he refreshed his memory by referring to 
these notes if he liked. The notes were dis 
carded, but his ordering of the letters was 
typed and presented to the second S to 
memorize. The second S was given the same 
cards (after they had been shuffled) and asked 
to arrange them in order, and this second 
ordering was typed and presented to the 
third S, and so on 

With each new ordering, the list gradually 
approached the order of English spelling, so 
the time given to study a list was gradually 
reduced. As far as possible, E adjusted the 
time so that S made from three to six errors 
when ordering a list. By the eleventh order- 
ing, the time had been reduced to 12 sec. for a 
list. 

lo reduce the serial positior effect, the Ss 
were told: “‘Most people remember the first 
and last portions of a list best. Since you 
must arrange all the cards, you will do better 
if you spend proportionately more of your 
time memorizing the middle of the list.” 

Experiment II.—For word sequences, Exp 
I was repeated with the following changes: 
(a) For materials, six sentences ranging from 
18 to 24 words in length were selected from 
“Storyville Days and Nights” by Louis Arm- 
strong (1954) and the words in each sentence 
were scrambled. (5) The first S was given 35 
sec. to memorize a list, and by the final order- 
ing this had been reduced to 13 sec. (c) Six 
teen Ss (or 16 orderings) were needed before 
the lists reached a limit of 


no further ap- 
proach toward English 


RESULTS 


Letter-sequence interferences. 
periment I clearly showed that Ss 
replaced uncommon letter sequences 


Ex- 


with more common ones. Although 
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TABLE 1 


ORIGINAL List AND ELEVENTH ORDERING FOR FIVE LISTS OF SCRAMBLED LETTERS 


Original List 





(a) NEHSMFECR DI VARWER HOTIHTHF ER 


(b) HNEWTWY NHETO NT HTKN AOSNDO 
(c) WFD ALHOD HONIO NOOEK WDE ETTU 
(d) TEFTLH EAGN HGRE PULA EHT 

(e) EHH NC TEHEHM IRATDL IRWCHDE 


they were asked to reconstruct the 
presented order exactly, with the first 
four or five reconstructions at least, 
the lists came closer and 
English spelling. 

Here are several orderings for a 
list selected at random: 


closer to 


Original IE TYOUW LHRHES MNHKWEIH CNI 


ETAR 
First AI THYHLI 
ETCESN 


MUHK WEIO EHRRWN 


Second Al HYTWE WENKO HICH ERRHN 


MELTESN 


Sixth AI WEKNO HICH NERYHUE SWERT 


METHL 
Eleventh OWI HWE HEUI ACKRI 
MEYTHEL 


NTHSRN 


The original and the eleventh 
orderings for the other five lists are 
given in Table 1. 

It may not be obvious that the final 
orderings in Table 1 resemble English 





141,000 WORDS 


PER 





MEAN TRIGRAM OCCURRENCE 





| 1 
FOURTH SEVENTH ELEVENTH 
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Fic. 1. Mean trigram frequency for six 
lists of letters arranged in order of successive 
orderings by Ss. 


Eleventh Ordering 





MERCHIV REAHER SFDT THON HFWI 
SWYTHE TKNOT ANDNO WEHN TOHN 
ALH KEETE WOUD HUOTN DOWD FOOI 
TEATHELD TUG NAHL EFH ERG 

CHH CHEEME HUDWAR DIRLT HEIT 


more than the originals, but Fig. 1 
quantitatively that Ss did 
replace infrequent sequences with 
ones that more closely resembled 
English spelling. In Fig. 1, mean 
frequency per trigram for all six lists 
is plotted against successive orderings. 
For every list, all the three-letter 
sequences were written down. Spaces 
were not considered characters. (For 
instance, IE TYOUW LHRH yields Tyo, 
YOU, OUW, LHR, HRH.) The frequency 
with which each trigram occurred in 
141,000? English words was deter- 
mined (Underwood & Schulz, 1960, 
Total Count); and mean frequency 
was plotted against successive order- 
ings. For the first four or five order- 
ings at least, it is apparent that Ss 
replaced presented sequences with 
sequences that more closely resembled 
English spelling. Tau between fre- 
quency and ordering is .36 which is 
significant the .05 level. 
Note that 6 of the 11 Ss were tested at 
the horizontal part of the curve; 
therefore this tau is a very conserva- 


shows 


beyond 


2 Underwood 
state that 


and Schulz (1960, 
lotal Count represents fre- 
1,035,000 
Underwood (personal com- 
has indicated that this is in error. 
In Appendix D, the Thorndike-Lorge (1944 
count was based on a sample of 2,080 of the 
19,440 words in the The Teacher’s Word Book 
of 30,000 Words, and so represents a base 
closer to 106,000 than 1,000,000 words. Thus, 
the Underwood and Schulz Total Count 
represents trequenc ies of i 
approximately 141,000 words. 


p. 75 
their 
quencies of occurrence in 
However, B. 


words. 


munication 


occurrence in 





SEQUENTIAL INTERFERENCES 


rABLE 2 


OxIGINAL L’ST AND SIXTEENTH ORDERING FOR FIVE Lists OF SCRAMBLED 
SENTENCES 


Original List Sixteenth Ordering 


(a) ABOUT WAS GOOD-LOOKING WAY AND HE WAS A YOUNGSTER NICE QUIET WITH 
TREATING MADE OF THAT A HIM THE MANNERS GOOD-LOOKING AND A WAY OF 
QUIET YOUNGSTER NICE HE MANNERS A TREATING THEM THAT MADE THE GIRLS GO 
THEM GIRLS WILD GO WITH WILD ABOUT HIM 


AS BE CHILDHOOD TO LIVED FROM FRIENDS | WE LIVED AS FRIENDS LONG TRUE FROM 
BEEN CONTINUE REAL HAD TRUE AS WE | CHILDHOOD AND WE WOULD BE REAL TO 
LONG WE AND WOULD WI CONTINUE AS WE HAD BEEN 


I BACK AND LONG AND SITTING ALL WORK I AND MY ALL DAY LONG SHOVELING COAL IT 
DAY MULE MY MY IT SHOVELING GET WAS HARD AWFUL WORK AND SITTING 
BEHIND TO AWFUL USED PAINS COAL IN BEHIND MULE USED TO GET PAINS IN MY 
HARD WAS BACK 


GOOD WE BIG AND ME A COOKED IRENI GOOD CAME ME AND FOR HIM IRENE COOKED 


GUMBO SEE OF HIM POT FOR HE AND CAMI A BIG POT OF GUMBO TO SEE HE AND WE 
TO 


THEM HEART I WORLD OF MY FROM THI NEVER SAY THAT WORLD AND BE MY HEART 
TO THE BOTTOM BE REPLACE WILL ABLI ABLE TO REPLACE THE I WILL FROM THEI 
NEVER THAT SAY AND BOTTOM OF THEM 


tive estimate of the correlation be- Several orderings for a list selected 

tween frequency and ordering.’ for its inherent interest are given: 
Figure 1 shows that after the fifth ~ 

ordering—after the sequences reached Original) STAND DID LINGERIE THEY IK 


f | 130 STORYVILLE THE DID AS NOT NEIGHBORHOOD 
¥ ( ~ITrence 7 ‘ 
a mean currence oO about ) FINE IN DOORWAYS SILK THEIR WEARING 


occurrences per 141,000 words GIRLS OUR IN. (First) STAND THEY DID IN 
there was little if any further approach STORYVILLE LINGERIE FINE THEIR THE DID 
toward English spelling. Although NOT IN SILK NEIGHBORBOOD AS DOORWAYS 
WEARING GIRLS OUR IN. (Second) STAND 
DID THEY STORYVILLE LINGERIE FINE THEIR 
IN SILK NEIGHBORHOOD OUR THE NOT GIRLS 
nonsense. This suggests that the Ss DID WEARING IN DOORWAYS IN AS. (Fourth 
deliberately tried to reproduce the FINE DID THEY STAND IN STORYVILL} 
same level of nonsense even when they LINGERIE IN NEIGHBOREOOD SLE OUR HO! 
GIRLS THE WEARING DID IN DOORWAYS AS 
THEIR. (Seventh) AS THEY DID STAND 
STORYVILLE LINGERIE IN SILK NEIGHBOR- 
periment I] repeated the above re- HOOD IN OUR DOORWAYS THEIR NOT WEAR- 
sults for word sequences: the Ss ING GIRLS FINE DID IN THE. (Eleventh 
AS THEY DID NOT STAND STORYVILLI 
LINGERIE IN THEIR FINE SILK DOORWAYS 
IN OUR NEIGHBORHOOD DID THE GIRLS 
new reconstruction, the arrangement WEARING IN. (Sixteenth) THEY DID NOT 
came closer and closer to a sensible STAND IN THEIR FINE SILK STORYVILLI 


the orderings changed somewhat, they 
remained at about the same level of 


had forgotten the exact sequence. 
Word-sequence _interferences.—Ex- 


replaced uncommon word sequences 
with more common ones. With each 


English sentence LINGERIE IN THE DOORWAYS WEARING AS 
Wnglisn s ’ 
OUR GIRLS DID IN NEIGHBORHOOD. 
3 These reconstructions were also scored 


as to bigram frequency per 121,000 words In Table 2, the original and the 
rhe curve was almost identical to Fig. 1. It 


' sixteenth orderings are given for the 
reached a pe ik at the fourth reconstruction, . ° ope, @ 

fell slightly, and began to rise again at the Other five lists. Even if it is not clear 
ninth reconstruction. from the examples that the sixteenth 
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MEAN RANKING BY JUDGES 





° l l l 
ORIGINAL a‘ ath 2 





SUCCESSIVE ORDERINGS 
(AND SUCCESSIVE SUBJECTS) 
Fic. 2. Mean ranking by judges for six 
scrambled sentences arranged in order of 
successive orderings by Ss. 


orderings resemble English more than 
the originals, Fig. 2 shows quantita- 
tively that Ss did replace unfamiliar 
word sequences with more familiar 
ones. In Fig. 2 successive orderings 
are plotted against mean rank as to 
sensibleness. For each scrambled 
sentence, the original and eight even 
orderings were typed on individual 
cards, shuffled, and 10 judges were 
asked to rank them as to their 
approximation to English.4 Tau be- 
tween ordering and mean rank was 
.89, which is significant beyond .0005. 

Figure 2 suggests that the orderings 
reached a limit of no further approach 
toward English somewhat before the 
sixteenth ordering, and the examples 
show that the sixteenth ordering is 
still far from a _ sensible English 
sentence. 

DISCUSSION 


The fact that Ss replaced uncommon 
sequences with more common on¢s is a 
rather direct demonstration of the extra- 
experimental interference from language 
habits, and further discussion would add 
little of value; however, three procedural 


all graduate students 
English. 
were 
one for their orderings of each sen- 


‘The judges were 
familiar with approximations to 
Six coefficients of 
puted 
tence. Mean coefficient of concordance was 
.88, significant beyond the .001 level. Clearly 
the judges agreed among themselves. 


concordance com- 


E. B, COLEMAN 


details are worth examining. They do 
not raise serious doubts about the main 
conclusions, but they may influence the 
shape of the functions in Fig. 1 and 2. 

First we should examine the conse- 
quences of E’s attempt to hold errors 
constant by progressively reducing study 
time as the lists became progressively 
more simple. For the later Ss, this con- 
stant-error criterion forced errors on 
sequences that almost 
English. Thus it raised the asymptote 
and reduced the negative acceleration 
somewhat. The more usual constant 
time limit produces a sharper negative 
acceleration: as soon as the list comes 


were correct 


close to English, Ss are able to memorize 
it completely and the curve levels off. 
Second, each S was exposed to and 
tested upon six successive lists totaling 
over 130 successive items. On the later 
lists, he was clearly subject to consider- 
able proactive interference from the 
earlier sequences and from his own re- 
sponses on earlier tests. Proactive inter- 
ference from S’s earlier lists should have 
no effect at all if we assume that there 
are not extraexperimental interferences 
from language habits. Interference from 
lists would cause him to replace one 
trigram (for instance) with another from 
an earlier list. Some replacements would 
be of more i 
English, some of less common occurrence. 
If the replacements were consistently 
of more common occurrence, then this 
would argue for extraexperimental inter- 
ference from language habits. Proactive 
interference from earlier tests would 
lead S to replace a trigram with a trigram 
from his own earlier reconstructions. 
This would raise mean trigram fre- 
quency because the mean trigram fre- 
quency of the reconstructions tended to 
be higher than the lists. Clearly this is a 
second-stage effect of extraexperimental 
interference from language habits. 
Third, it should be emphasized that 
the test not pure 
it was reconstruction 


common occurrence in 


reconstruction : 
confounded with 


was 


recall because Ss were required to jot 


down their recall of the list and were 
allowed to consult these notes during 


reconstruction. The reconstruction tech- 
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nique was simply a strategem to prevent 
the loss of elements that occurs in the 
usual serial reproduction technique. 

In addition to the three procedural 
points, a marginal finding is worth 
examining—the finding that long before 
the reconstructions became sensible Eng- 
lish, they reached a limit of no further 
approach toward English. After this 
point, although the orderings changed 
somewhat, they remained at about the 
same level of nonsense. Apparently Ss 
were deliberately trying to reproduce the 
same level of nonsense. 

In fact, several Ss _ reconstructing 
scrambled letters commented that when 
they were unsure of asequence, they would 
be certain that it was not a real word. 
For if it had been a real word, they said, 
they were sure they would have remem- 
bered it. Therefore in such a situation, 
they would purposely arrange the letters 
so that they did not spell a word. 

To check the fact of the asymptote, 
two lists of each kind were reconstructed 
10 more times by another group of Ss. 
rhe letter lists dropped from a mean of 
128 occurrences per 141,000 words at the 
eleventh 124 at the 
In short the fourth recon- 
resembled 


reconstruction to 
twenty-first. 
struction English 
closely as the 
reconstruction. 


spelling 


about as twenty first 


SUMMARY 


[wo extraexperimet 


ference were examined: 


interferences, which result when a list pre 
sented for learning conflicts with Engl 
spelling habits, and (b) word-sequence inter 
ferences, which result when a list presented 
for learning conflicts with the habits of English 
syntax. Both interferences were studied by 
serial reconstructions. The first S was ex 
posed to a list and then he tried to recon- 
struct its order. Each succeeding S tried to 
reconstruct the order given by the precedi ¥ 
S. With each 

scrambled lists came and 
English. the orderings 
reached an asymptote after which there was 
no further approach toward English 


successive reconstructio 


closer closer to 


sensible However, 
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THE EFFECT OF 


ORDER OF APPROXIMATION TO THE 


STATISTICAL STRUCTURE OF ENGLISH ON THE 
EMISSION OF VERBAL RESPONSES! 


KURT SALZINGER, STEPHANIE PORTNOY, anp RICHARD S. FELDMAN 


Biometrics Research, New York State Department of Mental Hygiene and Columbia University 


The variables controlling the emis- 
sion of verbal may be 
roughly divided into response-pro- 
duced stimuli (the speaker’s verbal 
responses surrounding the ‘‘controlled”’ 
response) and external stimuli (audi- 
ence, reinforcements, etc.). The pres- 
ent study deals with the effect of 
response-produced stimuli as specified 
by Miller and Selfridge’s (1950) 
passages of different orders of approxi- 
mation to the statistical structure of 
English. 


behavior 


These passages were originally con- 
structed for the purpose of studying 
the effect of sequential association 
upon memory. 
that the better for higher 
orders of approximation to English 
is related to the greater number of 
words determining each subsequent 
word. Many investigations have 
made use of these or similarly con- 
structed independent 
variables for such factors as memory 
(Deese & Kaufman, 1957; Marks & 


It was hypothesized 
memory 


passages as 


1 This investigation was carried out while 
the third author held a predoctoral fellowship 
from the National Institute of Mental Health, 
United States Public Health his 
study was supported in part by Research 
Grant M1541, in part by Research Grant 
MY 4758, from the National Institutes of 
Health. The authors wish to thank J. Zubin 
for his help and interest in this research. 
William Reynolds of Rutgers University 
deserves special thanks for making available 
a majority of the Ss. We appreciate the 
Hilda Brody of Columbia 
University in providing additional Ss, and of 
Robert Keisner and Phyllis 
analysis of part of the data. 


Service 


assistance of 


Ziotogura for 
This paper was 
presented in part at a meeting of the Eastern 
Psychological Association, Philadelphia, 1961. 


Jack, 1952; Miller & Selfridge, 1950; 
Richardson & Voss, 1960; Sharp, 
1958), shadowing of one of two 
dichotic messages (Moray & Taylor, 
1958), ‘‘meaningfulness’’ (Marks & 
Taylor, 1954), and eye-voice 
(Lawson, 1961). Therefore it 
thought worthwhile to obtain an 
exact measure of the amount of con- 


span 
was 


textual dependency in each passage 


in order to test the assumption that 
consecutive orders are equally distant 
from each other. Obviously, if they 
are not, the shape of any curves em- 
ploying this assumption in relating 
the above mentioned factors to ap- 
proximation to English 
inaccurate. 


would be 


The technique chosen for evaluating 
the amount of contextual dependency 
was the ‘‘cloze’’ procedure (Taylor, 
1953, 1954, 1956), which was origi- 
nated as a measure of readability 
and_ which having Ss 
guess the which have been 
systematically deleted from a given 
passage. Thus, application of the 
cloze procedure to 
will—in addition to supplying an 
exact measure of contextual depend- 
ency—provide standards for compar- 
ing different texts, e.g., one text 
might be described to be as readable 
as third-order, another as 
as seventh-order 
English. 


consists of 
words 


these passages 


readable 
approximation to 


Finally, this study will provide the 
opportunity to examine the effect 
of order of approximation upon the 
number of guessed words which fall 
into the same grammatical category 
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the “correct” word. 
This will make it possible to evaluate 
the the syntax, 
resulting from these approximations 


of speech as 


contribution which 


to English, makes to such functions 
as memory. 


METHOD 


Passages Che 50-word passages of Miller 
and Selfridge (1950), at each of the eight 
orders of statistical approximation to English 
(zero, first, third, fourth, fifth, 
seventh, and Text), were the basic material 
for this study. Each 
graphed on a separate sheet of paper, each 
deleted word being indicated by an underlined 
blank of constant size. 
tion of the passages was randomized within 
each set of eight and varied from S to S 


Experimental groups 


second, 


passage was mimeo- 


rhe order of presenta- 


Ninety-three under- 
graduate students, with a median age of 18.9 
yr., were required to participate in this experi- 
ment as part of their regular course work. 
Group A, (17 Ss, 11 male and 6 female 
Group Az (16 male Ss 
sages with every 


and 
were given the pas- 
fifth word deleted, starting 
with the fifth word of each passage and omit- 
ting the last word. Groups A; and A, 
essentially the same results on all 


gave 
measures, 
and therefore they have been combined into 
A,\A 
presented below. For all analyses of gram- 
matical category, only Group Az is included 
since it was assumed that further analysis of 
Group A, would have yielded the same results 

Group B 
passages as Groups A; and Ao», with every 
fifth word deleted, starting with the sixth 
word of each passage and omitting the first 
word. 

Group C (27 male Ss) was given the same 
with de leted, 
starting with the seventh word of each passage 


a single group in some of the analyses 


33 male Ss) was given the same 


passages every seventh word 
and omitting the last two words 
Procedure rhe adminis- 
They were told in the 
instructions read to them that the typescripts 
contained no punctuation and 
would have to 
thought began, that only one word was callec 
for in each blank, and that, when they were 
not certain, they should guess rather 
skip a blank. Thirty minutes 


for completion of the task 


passages were 


tered to Ss in a group. 


them- 
where a 


they 


selves decide new 


than 
were allowed 


RESULTS AND DISCUSSION 


Proportion of words correctly guessed, 
-Figure 1 shows that the proportion 


* AV. GROUPS A, @ Ay (ST OO! FOR DIFF AMONG ORDERS 
* GROUP 8 (5TH). » < FOR OFF. AMONG ORDERS 

FOR OFF. AMONG ORDERS 

*.53 FOR OFF AMONG GROUPS 


PROPORTION OF CORRECT WORDS 





7 reat 
ORDER OF APPROXIMATION TO STATISTICAL STRUCTURE OF ENGLISH 
Fic. 1. 


rectly as a function of order of approximation 
to English. 


Proportion of words guessed cor- 


of correctly guessed words increased 
from Orders 0 through 7, with 
approximately 


Text 
equal to Order 5. 
Friedman analyses of variance (Siegel, 
1956), used for this and all subsequent 
comparisons, showed that the three 
(A,As, B, C) did 
significantly from oneanother (P =.53) 
that, group, the 
increase Over orders was statistically 


groups not differ 


and within each 


significant (P < .001 for each group). 
Therefore, the mean curve in Fig. 1, 
based on N = 93 and on about 40% 
of the words of each passage, repre- 
reliable 
these data than each curve separately. 

This result also makes clear that 
having 6 words on either side of the 


sents a more summary of 


blank does not produce more correct 
guesses than having 4 words on either 
the blank, thus reinforcing 
the conclusion of Taylor (1954) and 
MacGinitie (1961) that the words 
predicted for every fifth blank 
independent of each other, and the 


side of 


are 


conclusion of Aborn, Rubenstein, and 
Sterling (1959) that 11-word sentences 
result in optimum predictability in 
comparison to 6-word and 25-word 
Apparently Ss either do 
not or cannot make use of a context 


sentences. 


of more than 5 words on either side 
of each blank. 

The fact that the relationship be- 
tween order and proportion of correct 
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TABLE 1 


MEAN NUMBER AND PROPORTION OF 
ORDER OF APPROXIMATION TO THE 


Group 


| 
| Score 
| 
| 


1 


Number® .03 
Proportion» 003 | 


.09 
010 | 


Number 
Proportion 


29 
.032 


Number 
Proportion 


Mean Number 
Mean Proportion 


* Number of correct words per S. 


> 


61 
.068 | 


21 
.023 


Exact CORRECT WORDS AS A FUNCTION OF 
STATISTICAL STRUCTURE 


OF ENGLISH 


Order of Approximation 


3 


1.64 
.182 


1.39 
154 


2.46 
.273 


> Proportion of correct words to total number of words guessed 


words is not linear means that the 
assumption of equal distances between 
consecutive statistical orders is not 
tenable. If we utilize the values in 
Table 1 as correct estimates of the 
distances between orders of approxi- 
mation, then the distance between 
Orders 2 and 3 is more than five times 
the distance between Orders 1 and 2. 
For memory improvement on these 
same passages’ (Selfridge, 1949), the 
corresponding ratio is less than 1.5. 
Figure 2 shows the memory data 
plotted in two ways: (a) according 
to the assumption of equal intervals 
between orders of approximation, 
and (b) with orders spaced according 
to number of words correctly guessed 
in the present study. From Orders 3 
through 7, the curves are essentially 
the same. Up toOrder 3, however, 
the intervals between successive orders 
vary greatly in size, indicating that 

2 The set of passages used in the present 
study is available in Miller and Selfridge 
(1950). However, their memory data are 
based on averages of two passages at each 
order of approximation. The percentages 
of words correctly recalled, for the single set 
of passages used here, were obtained from 
Selfridge’s (1949) thesis, and correspond to 
her first set of lists. 


the equal-intervals assumption is least 
tenable in thisregion. One addition- 
al fact is that Text gives rise to fewer 
correct words than Order 7. This in- 
dicates that it was misplaced on the or- 
der continuum and also suggests that 
different texts might well appear in 
different positions on this continuum. 
Proportion of different words.—Fig- 
ure 3 shows a plot of the proportion 
of the total number of guessed words 
which were different. With the ex- 
ception of Text, which is again out of 
line, it is clear from this graph that 
increasing order of approximation not 
only the probability of 
evoking a correct response but also 
limits the variety of different re- 
sponses. The difference among the 
three groups of Ss approaches signifi- 
cance (.05 < P < .08), but the graphs 
of the separate groups each show the 
general tendency to decrease over 
orders. The decrease is significant 
in each case (P < .008 for Ay,As, 
P < .001 for B, P = .002 for C). 
Figure 4 shows a plot of the propor- 
tion of the total number of different 
incorrect words. With the exception 
of Text, there is a general decrease 
in the proportions with increasing 


increases 
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order, although the overall drop 
is only about half of what it is in 
Fig. 3. The differences among the 
three groups approach significance 
(.05 < P < .08) as before, but here 
this is due to the fact that Group C 
does not show a significant decrease 
over orders (P = .20) while Groups 
A,A> and B do show significant de- 
creases (P < .05 and P < .04, re- 
spectively). This is the only measure 
on which one of the groups differed 
significantly from the others. A 
suggested explanation is that, with 
only six blanks for each passage in 


ORDERS SPACED BY 
NUMBER OF C ct 
WORDS OBTAINED IN 
PRESENT STUDY , 
| (Lowen apscissa 
axis) a 
i A 


AA 


OROERS SPACED BY 
EQUAL INTERVAL 


UPPER ABSCISSA AXIS 


OATA FROM SELFRIDGE (1949) 


PERCENT OF WORDS CORRECTLY RECALLED 


ORDER OF APPROXIMATION TO STATIS 


Fic. 2. words correctly 
recalled (Selfridge, 1949) as a function of 
order of approximation to English, with an 
equal-interval scale for orders (dashed line), 
and with the intervals determined by the 
average number of words guessed correctly 
in the present study (solid line). 


AL STRUCTURE OF ENGLISH 


Percentage of 


which every seventh word was de- 
leted, the total number of guessed 
words may not been large 
enough to allow significant changes in 
number of different words from order 
to order when only incorrect guesses 
are considered. 

Grammatical classification.—The 
grammatical classification was based 
on Fries’ (1952) system of analysis. 
The two groups of words called 
general grammatical categories below 
are lexical words (roughly equivalent 
to the classical categories of nouns, 
pronouns, verbs, adjectives, and ad- 


have 


* AV. GROUPS 4) 6 £2 (5TH). p< COB FOR DIFF AMONG ORDERS 
* GROUP 8 (STH): p< 00) FOR DIFF AMONG ORDERS 
* GROUP C (TH). * 002 FOR OIFF. AMONG ORDERS 
— AVERAGE, O5<p< 08 FOR DIFF AMONG GROUPS 


EXACT CORRECT WORDS) 


TAL NUMBER OF WORDS 


IF FERENT WORDS (wc. 


4 


NUMBER OF C 


scectleiadihls ditiainiiaiea 

Fic. 3. Number of different words in- 
cluding the correct word (expressed as a 
proportion of the total number of words 
guessed) as a function of order of approxima- 
tion to English. 


verbs) and function words (roughly 
equivalent to articles, conjunctions, 
prepositions, auxiliary verbs, inter- 
jections, and quantity words). The 
five specific grammatical categories 
are nouns and pronouns considered 
as one class, verbs, adjectives, ad- 
verbs, and function words as already 
described. 

Figure 5 shows that, as order of 
approximation increases from Orders 0 
to 7, an increasing proportion of the 
words emitted by Ss belongs to the 
same specific grammatical category 
as the deleted words. The increase 
is significant for each group (P <.001 
for As, B, and C), and the groups do 
not differ from one another (P =.24). 


AV. GROUPS &, @ A» (ST 
* GROUP 8 (STH): <.04 FoR 
* GROUP C (7 
— AVERAGE; 05 <p< 


<.05 FOR DIFF AMONG ORDERS 
OFF. AMONG ORDERS 

F AMONG ORDERS 
OFF. AMONG GROUPS 


— 


NUMBER OF DIFFERENT INCORRECT WORDS 
TOTAL NUMBER OF INCORRECT WORDS 


ORDER OF APPROXIMATION TO STAT 


Fic. 4. Number of different incorrect 
words (expressed as a proportion of the total 
number of incorrect words) as a function of 
order of approximation to English. 


STICAL STRUCTURE OF ENGLISH 





wn 
On 


* GROUP Ap (STH)ip <.00! FOR DIFF. AMONG ORDERS 
= GROUP B (STH)p <.00! FOR DIFF. AMONG ORDERS 
* GROUP C (TTH)h» <.00! FOR DIFF. AMONG ORDERS 
— AVERAGE 1p * 24 FOR DIFF. AMONG GROUPS 


EXACT CORRECT WORDS) 


NCL 


1ON OF WORDS 


IN THE CORRECT SPECIFIC GRAMMATICAL CATEGORY 


PROPORT 


er 
F APPROXIMATION T 


Fic. 5. Proportion of guessed words be- 
longing to the same specific grammatical 
category as the correct word, as a function of 
order of approximation to English. 


TICAL TRUCTURE OF ENGLISH 


This graph again shows performance 
on Text to be worse than at Order 7 
and only slightly better than at Order 
5. It should be noted that at Order 0 
Ss perform very close to chance level, 
i.e., when the emitted words are put 
into the five grammatical 
categories, about one-fifth fall into 
the correct category. 

Figure 5 differs from Fig. 1 (num- 
ber of correct words) in that it seems 
to approach an asymptote at about 
Order 3 with only a relatively small 
increase after that, while the number 
of correct words rises as much after 
as before Order 3. It is also apparent 
that the maximum proportion of 
words in the correct grammatical 
category is more than twice as large 
as the proportion of exact correct 
words, i.e., while Ss may not react 
to the meaning they react to the 
syntax. This type of “knowledge’’ 
may well be used as a further way of 
distinguishing different texts. 

Again comparing Fig. 5 and 1, it is 
seen that the increase from Orders 1 
to 2 for words in the correct specific 
grammatical category accounts for a 
considerably greater proportion of the 
overall than 
responding change for exact correct 
words. Beyond Order 3, as already 
stated, correct specific grammatical 


specific 


increase does the cor- 
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classifications appear to approach 
an asymptote while number of exact 
correct words continues to increase. 
Therefore, if grammatical classifica- 
tion of guessed words is: taken as a 
measure of amount of syntactical 
structure, and exact correct words are 
taken as a measure of amount of 
meaning, improvement in memory 
must be attributed primarily to in- 
creased syntactical structure between 
Orders 1 and 2, about equally to syn- 
tactical structure and to meaning 
between Orders 2 and 3, and primarily 
to meaning beyond Order 3. 

If we examine the relationship be- 
tween order and specific grammatical 
category, taking into account only 
those responses which differed from 
the deleted words, it is found to be 
nearly identical to that shown in 
Fig. 5, where correct responses are 
included. The increases over orders 
are significant (P < .001 for As, B, 
and C) and the three groups do not 
differ from, one another (P > .15). 

The relationship between order and 
general grammatical category 
remains nearly the same whether or 
not correct responses are taken into 
consideration. Again at Order 0 Ss 
perform very close to chance level, 
i.e., when the emitted words are put 
into the two general grammatical 
categories of lexical and function 
words, without further subdivision, 
about one-half fall into the correct 
category. In general these relation- 
ships are very similar to the analogous 
ones for specific grammatical cate- 
gories. Although the increases over 
orders cover a smaller range when 
the words are divided into only two 
categories, they are significant for all 
three 


also 


(including correct re- 
.002 for A», P < .001 


groups 


P < 


sponses : 


for B and C; not including correct 
responses: P < .02 for Ao, P < .001 
for B and C), and the groups do not 





EMISSION OF VERBAL RESPONSES 


differ from one another (P = 
both cases). 


These results are in general agreement 
with Epstein’s (1961) finding that learn- 
ing of a syntactically structured series 
of words is superior to that of a random, 
unstructured series of words, since the 
two series correspond to Order 1 and 
Text of the 
found in 
significant 


present study. 
addition that there 


difference between his Ss’ 


Epstein 


was no 


ability to learn syntactically structured 
syllable material and their 
ability to learn syntactically unstruc- 
tured meaningful material, and_ this 
result demonstrates that syntax or 
meaning alone contributes to learning. 


nonsense 


This makes more plausible our suggestion 
that both and syntax make 
contributions to memory for Miller and 
Selfridge’s 


“‘meaning”’ 


1950) passages. 


SUMMARY 


Ninety-three undergraduate students were 
required to guess the words that were system- 
atically deleted from a series of passages vary- 
ing in order of approximation to the statis- 
tical structure of English The Ss guessed a 
greater proportion of words the higher the 
order of approximation to English. Propor- 
tion of words in the correct grammatical cate- 
gory also showed an increase with increasing 
order of approximation. Proportion of words 
in the correct grammatical category increased 
most from Order 0 to Order 3 while proportion 
of correct words continues to increase as much 
after Order 3 as before it. Thus analysis of the 
predicted words into their grammatical cate- 


gories may well be useful for further differen- 


tiation of text materials, since the two types 


of analysis do not yield the same 


information 


type of 
Finally, it was pointed out 
that the assumption of equal intervals be- 
tween successive orders of approximation 
is untenable, and the relationship between 
memory and order of approximation to 
explained in part by syn- 
tactical structure and in part by the 


ing’’ called for by the context 


English can be 


*“mean- 
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THE EFFECTS OF 


RESULTS ON THE 


RAYMOND R. SIPOWICZ, J. 


United States Army 

Although the effects of reward and 
knowledge of results on human per- 
formance have been intensely studied, 
the effectiveness of such variables on 
human monitoring has received rela- 
tively little attention. In view of the 
practical importance of vigilance re- 
search and the need to sustain and 
maintain a high level of detection, 
this neglect is somewhat surprising. 
Pollack and Knaff (1958) studied the 
effects of reward and punishment on 
monitoring, and reported that punish- 
ment was more effective than reward. 
Moreover, the reward condition proved 
relatively ineffective in improving 
performance beyond a level obtained 
without reward. 

Mackworth 
(1959); Loeb and 
and Weidenfeller, Baker, and Ware 
(1962) did, however, obtain signifi- 
cant improvement in vigilance per- 
formance using knowledge of results 
(KR). In the last study, not only 
KR but “false KR’ was also found 
to be effective. Knowledge of results 
in this experiment was given by means 
of a bright white light. While it might 
be assumed that such added stimula- 
tion would raise S’s activation level 
and hence increase detection prob- 
ability, the inclusion of a control 
group having the light alone (ad- 
ministered in synchrony with the 
typical vigilance decrement) showed 
no arousal effects. A similar finding 
has been reported by Davis, McCourt, 
and Solomon (1960). 

The effectiveness of KR in these 
studies, and the apparent failure of 


(1950) ; 


McCormack 
Schmidt (1960) ; 
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REWARD AND KNOWLEDGE OF 
PERFORMANCE 
VIGILANCE 


OF A SIMPLE 
TASK 


A. BAKER 


Armor Human Research Unit, Fort Knox, Kentucky 


“reward” in the Pollack and Knaff 
(1958) study, is somewhat contra- 
dictory—especially in view of the 
fact that feedback or KR can be, 
and often is, logically regarded as a 
form of reward or reinforcement. In 
the Pollack and Knaff study no men- 
tion made of the amount of 
reward except to say that it was an 
extra hour of pay. Further, the con- 
ditions of reward were such as to make 
it difficult for more than 1 or 2 Ss in 
the group to be rewarded. 

If, however, in the typical vigilance 
study the reward is contingent upon 
the maintenance of certain level 
of vigilance by each S, the effects 
of the reward might be more pro- 
nounced than in a situation where a 
reward is given for each signal de- 
tected or on a competitive basis where 
only the best are rewarded. More- 
over, if reward and KR are indi- 
vidually effective, a condition combin- 
ing the two might be doubly effective. 

In order to furnish answers to some 
of these questions, the effects of KR, 
reward, and the combination of the 
two in a typical vigilance task were 
studied under the conditions in which 
a monetary reward was individually 
administered and was contingent upon 
the maintenance of a high level of 
performance. 


was 


METHOD 


Subjects—Eighty Fort Knox armor train- 
ees, aged 17 to 24 yr., free from visual defects, 
served as Ss, Twenty Ss were arbitrarily 
assigned to Group C (Control), 20 to Group 
KR, 20 to Group R (Reward), and 20 to a 
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combined KR and reward 
R + KR). 

Apparatus.—The Ss’ task was to detect 
aperiodic interruptions of a continuous light 
source over a 3-hr. period. The light source 
(12 v. de bulb, .45 ft-c as a point source, 
operating at 54 v.) was located at approxi- 
mately eye level in a flat black plywood box. 
The schedule of interruptions (signals) was 
predetermined on the basis of 12 signals per 
4 hr. with a total of 72 for the 3-hr. session. 
The randomized intersignal intervals ranged 
from 24 to 360 sec. with an average interval 
of 150 sec. The schedule was presented by 
means of a film-tape, fed through a Gerbrands 
variable-interval programer, and a 
timing circuit. The latter determined the 
duration of the interruption (.03 sec. measured 
at the timing relay with a Hunter Klockoun- 
ter), and the former determined the intervals 
between the interruptions The _ signal 
presentations and Ss’ signal detections were 
recorded by a 20-pen Esterline-Angus opera- 
tions recorder. 


group (Group 


simple 


Procedure.—Each S, wearing earphones to 
reduce ambient noise, monitored the light in 
an isolated room. For the no-KR condition, 
S was told to press the response button as soon 
Each S was further told 
that the signals could occur at any time, so 
it was necessary to remain alert and watch 
the light at all times. To insure that S 
understood the requirements, a_ practice 
period of 10 signals (at 1-min. intervals) was 
given before the watch session began. 

For the KR conditions, a 14-in. pilot lamp 
was installed at approximately eye level on 
the right side of the monitoring display. The 
S’s requirements were the same as before; S 
was told that the pilot lamp would light if he 
missed a signal, thus indicating a signal was 
missed and that he should be more alert 
in order to detect subsequent signals. This 


as he saw a signal 


light, unlike the signal light, was covered by 
a white filter lens and operated at its maxi- 
mum voltage—6 v. dc. The light was flashed 
for a 2-sec. period if a given signal was not 
detected within 5 sec. after its presentation. 

For the reward conditions, each S was told 
he would receive $3.00 if he detected all the 
signals. In addition, for each signal missed S 
was told that he would lose a portion of the 
initial $3.00 in geometric progression begin- 
ning with 5 cents, for the first miss. By miss- 
ing six or more signals during the watch 
session, S lost the entire amount. Thus, if S 
missed only one signal in the 3-hr. period he 
received $2.95; for two misses, he received 
$2.85; for three, $2.65; for four, $2.25; for 
five, $1.45, and with six or more 
nothing. As in the KR condition, a miss was 
scored if S failed to report a signal within 5 
sec. after its appearance. 

Although the study was designed for 20 
Ss per group, several Ss fell asleep during the 
watch session. Since Ss could not be directly 
observed, it was arbitrarily decided to elimi- 
nate from the study all Ss who missed 12 or 
more consecutive signals. This was done on 
the assumption that Ss missing 12 or more 
consecutive were either asleep or 
uncooperative. On this basis, 5 of the original 
80 Ss were rejected—2 from Group C, 1 from 
Group KR, and 2 from Group R. None of the 
20 Ss, however, were eliminated from Group 
R+KR. In an attempt to equalize the 
number of Ss in each group, additional ses- 
sions were held. Due to experimental diffi- 
culties, however, data were obtained on only 
19 Ss in Group KR. 


miusses, 


signals 


RESULTS 


The mean percentage of signals 


missed by each group, by 30-min. 
° r- ° rT 
periods, is shown in Table 1. 


On an 


TABLE 1 


MEAN PERCENTAGES OF SIGNALS MISSED FOR ALL GROUPS OVER SIX 30-MIN. 


PERIODS AND OVER ENTIRI 


3-HR. SESSION 


Successive 30-Min. Periods 


Group 


NM bt he he 
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overall basis, Group C missed 24.3% 
of the signals; Group KR, 12%; 
Group R, 8.4%; and Group R+KR, 
4.3%. The raw data in terms of 
number of signals missed by the 
groups were analyzed to obtain means 
and variances. <A test for 
geneity of variance was performed 
on these data and 
significant F value. The data were 
transformed to a log scale and then 


homo- 


resulted in a 


subjected to an analysis of variance. 

This analysis showed the groups 
were significantly different (/= 20.92, 
df 3, P < .01). Using Duncan’s 
(1955) multiple range test, all the 
experimental groups were found to 
differ significantly from the control 
(P < 01). The mean difference be- 
tween Group R + KR and Group KR 
was also significant at the .01 level. 
The difference between Group R+ KR 
and Group R was, however, significant 
at only the .05 level. 
difference was found between Group 
R and Group KR. 


No significant 


An inspection of the performance 
curves also revealed the typical vigil- 
ance decrement within the first hour 
for Group C. This expected decre- 
ment, however, was not shown by any 
of the experimental groups. For the 
experimental groups, the greatest 
decrement occurred for Group KR 
between the first and fourth 30-min. 
A correlated ¢ test for the 
decrement, however, showed the dif- 
ference was nonsignificant. 


periods. 


Although it might be argued that 
knowledge of results should lead to an 
the 


sponses, an analysis of the present 


increase in number of false re- 
data showed no significant differences 
between groups. The Ss in Group KR 
averaged 1.05 false responses as com- 
pared with 0.84 for Group C, 0.72 for 


Group R, and 0.25 for Group R+ KR. 


R. WARE, AND R. A. 


BAKER 


DISCUSSION 


Of most importance was the finding 
that the combination of reward 
knowledge of results led to the detection 
of better than 95% of the signals—a 
performance at a “near perfect’”’ level. 
Further, the extremely small variance 
of this group may be some indication 
of the extent to which individual differ- 
ences in performance can be decreased 
by the use of such motivational tech- 
niques. Classification of Ss into “good 
performers’ and ‘‘bad performers” as 
was done in other studies (Buckner, 
Harabedian, & McGrath, 1960; Fraser, 
1953) may be adequate in vigilance tasks 
in which no attempt is made to sustain 
a high level of motivation. 


and 


Results from 
this study, however, suggest the per- 
formance of all Ss in monitoring tasks 
may be sustained at a high level. Thus, 
the problem of selection and assignment 
of Ss to vigilance tasks on the basis of 
their initial ability can be eliminated if 
effective motivational techniques are 
discovered and properly employed. 

Of additional interest is the fact that 
reward alone proved to be only slightly 
more effective than KR. It should be 
noted, however, that in the reward 
situation S had no of knowing 
whether or not a singal was missed and, 
consequently, unable to adjust 
effectively his monitoring behavior to 
meet the requirements of the task. 


way 


was 


The relatively high level of per- 
formance, as well as the failure to find 
the usual vigilance decrement for the 
two reward conditions, focuses consider- 
able attention upon the method used 
in dispensing the reward. First of all, 
Ss did not have to detect all of the signals 
in order to be rewarded. The 
ever, was required to detect at least 93% 


S, how- 
of the signals. Moreover, Ss were told 
before the session began that they would 
be given the reward. Their subsequent 
performance determined whether or not 
it would be retained. This technique, it 
is believed, was particularly effective 
in sustaining a high level of motivation 
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the entire watch period. When 


specific knowledge of the reward situa- 


over 


tion was then made available throughout 
the watch S’s motivation 
further 
leve | of performance was obtained. 
KR 


were 


session, was 


increased and an even higher 


In previous studies of such as 
Mackworth’s (1950), Ss 
whenever they made a false response, 
when they failed to detect a given signal, 
and when they correctly detected a 
It is likely, however, that much 

information is 


informed 


signal. 

of this 
Knowledge of results as used in this study 
they 


superfluous. 


gave Ss information only when 


failed to detect a given signal. In terms 
of the mean number of signals missed, 
this tec hnique was as effective as thos« 
used by Mackworth. Mackworth’s ex 
perimental group missed 17.3% of the 
2-hr. session, 


signals presented in a 


whereas Ss in the present study missed 


2 


only 12% over a 3-hr. period. 


SUMMARY 


Four groups of Ss (2 a group) monitored 
aperiodic and brief interruptions of a contin- 
light 
3-hr 


uous under isolated conditions 


for a 


source 

period. The Ss in Group R were 

$3.00 if they detected all sig 
watch 

75, 1.55, or all if they 


given nals 


pre- 
sented during the session, but lost 
05, a ] missed one 
two, three, four, fi or six signals 
KR was informed of ¢ 
bright flash of light. Group 
ceived both KR and reward 


the schedule for Group R 


Group 


| signals missed by a 
R+ KR re- 
according to 
Group C, a con 
trol, received neither reward nor KR. 

Although all experimental 
significantly better than the 
the combination of reward and KR produced 
the highest 
results are 


groups were 


control group, 


that 


n be effective in main 


interpreted as indicating 
either reward or KR ca 
a high level of vigilance The effec- 


highly de- 


taining 


tiveness of reward, however, is 


level of signal detection Che 


pendent upon the manner in which it is used 
The effectiveness of such 
proving performance and reducing 
variability attenuated the 
previously assigned to individual differences 
It is further suggested that such differences are 
primarily 


incentives in im- 
inter-.S 


also importance 


motivational and, as such, are 
susceptible to experimental manipulation and 


control. 
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SOME CONDITIONS INFLUENCING THE ACQUISITION 
AND UTILIZATION OF CUES! 


LOY S. BRALEY? 


University of Buffalo 


A paper by Bruner, Matter, and 
Papenek (1955) posed the question 
as to the variables that control the 
range of environmental information 
that an organism acquires during 
some behavioral sequence. This prob- 
lem of the “breadth of learning”’ 
assumes that an organism is exposed 
to a wide range of possible cues under 
most circumstances, but that re- 
sponses are acquired to only a portion 
of these with the others remaining 
as’ undifferentiated context. While it 
is clear that the range of cues acquired 
may be differentially advantageous 
depending on the task requirements, 
the focus of their paper and the 


present study was on the acquisition 
of cues that were not immediately 


task-relevant, but which became so 
at a subsequent time. The above 
authors demonstrated, as conjectured 
by Tolman (1948), a reduction in sub- 
sequent utilization of initially irrele- 
vant cues as a product of overlearning 
and higher levels of motivation. 

This was but a start on an empirical 
question. Recent reviews by Easter- 
brook (1959) and Kausler and Trapp 
(1960) on cue utilization demonstrate 
the continued interest in this problem. 
Easterbrook, concerned with the rela- 
tion of emotion to behavioral organiza- 


1 This article is based on a PhD disserta- 
tion submitted to the Faculty of the Graduate 
School of Arts and Sciences of the University 
of Buffalo. The author is greatly indebted to 
Ira S. Cohen, under whose guidance this 
study was conducted, and to Walter Cohen 
for many valuable suggestions 
all phases of the investigation 

2 Now at University of California, Santa 
Barbara. 


throughout 


tion took the position that an in- 
crease in emotional. arousal functions 
to reduce the utilization of 
Consequently, degree of behavioral 
organization depends on task de- 
mands for ‘‘breadth”’ of cues. Kausler 
and Trapp were in substantial agree- 
ment although their focus was on the 
effects of motivation on the utilization 
of cues in incidental learning. They 
suggested additional relevant varia- 
bles of motivation as generalized D 
versus incentive-oriented set, spatial 
contiguity of relevant and irrelevant 
cues, and task difficulty. 

The present study was designed 
(a) to demonstrate with human Ss, 
as Bruner et al. (1955) have done with 
rats, that exposure to initially irrele- 
vant cues will facilitate later learning 
when these cues become the relevant 
discriminanda, (0) to test the hypothe- 
sis that high scores on a response- 
inferred motivational variable (Tay- 
lor MA scale) are inversely related 
to acquisition and utilization of such 
cues, and (c) to determine whether 
the presence of the previously correct 
cue from earlier learning would dif- 
ferentially facilitate or retard the 
shift to new cues as a function of low 
and high motive strength, respec- 
tively. Certain clinical conceptions 
of anxiety (Cameron, 1951; Sullivan, 
1953) would predict 
severative tendency. 


cues. 


such a_ per- 
METHOD 


Eighty extreme scorers on the 
MA scale were selected as Ss from a sample 


Subjec ts. 


of 525 undergraduate questionnaires obtained 
at the University of Buffalo and Springfield 
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College. The upper 20% of this sample 
had an MA score range of 20 to 40, with a 
mean of 25.3; the lower 20% q range of 0 to 7, 
with a mean of 4.59. The mean of all scores 
13.77. Twenty-four Ss were female, 13 
of which were high MA scale scorers. All Ss 
were contacted and asked to participate in 
such a manner as to avoid any association 
between the MA scale and the experiment 
proper. 


was 


Because of contradictory evidence (Taylor, 
1955) on the relationship of IQ to MA score, 
a short form of the Wechsler-Bellevue, Form 
I (Herring, 1952) was administered. The 
correlation between IQ and MA was .05. 
Learning rate on the experimental task and 
IQ revealed a nonsignificant r of —.22. 

Task and materials.—The 
task was one of concept learning with simul- 
taneous presentation of a positive and negative 
instance on each trial. Thisissimilar to the Bru- 
neretal. (1955) procedureand that of Blumand 
Blum (1949), and Lashley (1942) in their con- 
tributions to the continuity controversy. Each 
instance consisted of twosmall geometric figures 
insidea larger geometric figure, drawn ona white 
4X 6in. card. The three figures on any card 
were combinations of circles, triangles, and 
squares each of which could be 
green, or yellow. 


stimulus basi 


red, blue, 
An instance then, might 
be a small red circle, small green triangle, 
within a large blue square. The stimuli 
for the three of the experimental 
procedure were as follows: 


stages 


Stage I: All Ss learned a concept where the 
positive cue was two small like-colored figures 
within a different color. 
Only 12 positives were used as pretesting had 
indicated rapid learning for this simple 
concept. Since only two colors appeared on 


large figure of a 


these positives, 12 combinations were possible. 
This also permitted appropriate combinations 
of the three types of figures. 


The same negative instances were used 
in all three stages so the possibility of memori- 
zation had to be obviated. The 30 negative 
instances were combinations of 4 values of 
color, 3 of shape, and 2 of size (small or large 
figure). 
figures of 
avoid 
colors « ue. 

Stage II: While all Ss continued to be 
reinforced for the Stage I positive cue, half 
of-the Ss were exposed to 10 trials of irrelevant 
cues 


In addition, 6 negatives with all 
the same color were included to 


concept attainment by a different- 


Five of these consisted of cross-hatching 
in the large figure and 5 consisted of the 
substitution of an L shape for one of the small 
figures. All new irrelevant cues appeared on 
Stage I positive instances. 


Stage III: The positive cues were now 
either the presence of a cross-hatched large 
figure or an & shaped small figure. The 30 
positive instances were identical to the nega- 
tives except for the addition of cross-hatching 
on half, or substitution of a small L shape on 
the remainder. No two instances that were 
the same in original form were opposed during 
exposure to S. Half of the Ss in Stage III had 
the Stage I positive cue present as a negative 
instance on 50% of the trials. For this 
condition a set of 20 negatives was used 
consisting of the last 10 Stage I positives and 
the last 10 of the regular negatives. 

Design and procedure-—The experiment 
was a 2 X 2 X 2 factorial design with 10 Ss 
per cell. Conditions were: (a) high and low 
anxiety, (b) presence or absence of irrelevant 
cues in Stage II, and (c) presence or absence 
of Stage I positive cue during Stage III. 

The Ss were randomly assigned to one 
of the four experimental conditions according 
to anxiety grouping and were run individually 
The IQ measure was obtained at the conclu- 
sion of the experimental task. 

The S faced a plywood screen with two 
exposure windows at approximately eye level. 
With the simultaneous presentation of two 
cards on each trial S was to indicate his choice 
of the correct card by depressing one of two 
response keys fixed to the table immediately 
below the windows. A correct choice closed 
a circuit and reinforcement delivered 
immediately by a small white bulb recessed 
between and above the windows. Position 
of the correct card was varied according to a 
prearranged randomly chosen order. The Ss 
were instructed that their job was to choose 
the correct one of two cards and that they 
would have to begin by guessing. However, 
the reinforcing stimulus would indicate when 
they were correct in their choice so that even- 
tually they were to reach a point where they 
could correct card 


was 


choose the each time 

In Stage I, trials were continued to a cri- 
terion of 10 responses 
Eight Ss were discarded for failure to meet 
lrial 70. Stage II began with 
completion of the criterion trials and was, 
to S, continuous with Stage I in that 
was no interruption in presentation of the 
stimulus materials. The experimental group 
was exposed to the series of 10 irrelevant 
The control Ss were simply continued 
for 10 trials beyond criterion on the Stage I 


successive correct 


criterion by 


there 


cues. 


cue, giving an equal number of overlearning 
trials to both groups. Stage III was again 
continuous with Stage II and where rein- 


forcement could only be achieved by re- 
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TABLE 1 


MEANS AND SDs oF TRANSFORMED TRIALS TO 
CRITERION FOR CONCEPT LEARNING: 
STaGE III 


Irrelevant Cues 
Absent in 
Stage II 


| Irrelevant Cues 

| Posit | Present in 

| ositive Stage II 

Anxiety | Cue from oe 

| Stage I 
| Mez d Mean 

| | 

| Present! 


High Absent | 


Low 


Present! 
| Absent | 


sponding to either of the previously irrelevant 
cues. 


RESULTS 


There was no substantial difference 
in Stage I acquisition trials between 
control Ss (Mean = 10.20, SD = 11.60) 
and those exposed to the irrelevant 
Stage II (Mean 11.51, 
12.92). While not shown, it 
might also be noted that neither 
experimental control Ss made 
any errors during the 10 Stage II 
trials. 


cues in 


SD 


nor 


Table 1 presents the transformed 
means and SDs of the eight experi- 
mental conditions. A Bartlett test 
revealed nonhomogeneous variances 
among the conditions, necessitating 
a Square-root transformation to meet 
the requirements of the analysis of 
variance. 

Table 2 
for the Stage II cue ex- 
posure rather 
than facilitating performance in Stage 
Ill, exposure to the irrelevant cues 
impaired performance. It 
then that the 
quent learning through utilization of 
previously irrelevant cues, as found by 
Bruner et al. (1955), 
demonstrated with human Ss under 
these experimental conditions. 

Table 2 that 


irrelevant 


condition. However, 


appears 
of subse- 


facilitation 


cannot be 


It is also clear from 


indicates a significant F 


differences in the motivational condi- 
tion (MA scale) bear no relationship 
to performance in the Stage III con- 
cept learning. Further, an analysis 
of variance of Stage I cue learning 
and MA score (within MS larger than 
between MS for df = 7 
significant. 

For the third condition the analysis 
of variance indicates that continua- 
tion of the Stage | positive cue into 


72) was not 


Stage III facilitated performance in- 
dependent of MA scale level. Dif- 
ficulty in shifting from a successful 
(positively reinforced) cue to a new 
one while the old one is still present 
was no more characteristic of the 
high-MA Ss than of the low-MA Ss. 

With two cues required for concept 
attainment in Stage III it is of some 
interest to determine differential cue 
difficulty. com- 
puted for the two cues in the Stage 
[II task and an analysis of variance 
run on the exposure 
condition and the type of cue (cross- 
hatched ground vs. L shaped figure). 
A significant F (F = 10.62, df =1/156) 
emerged for the irrelevant cue condi- 
tion, confirming the analysis by trials, 
as well as for the type of cue (/ =8.88, 
df = 1/156). The cross-hatched cue 
covering the entire background of the 


Error scores were 


irrelevant cue 


rTABLE 2 


\NALYSIS OF VARIANCE OF TRANSFORMED 
[RIALS TO CRITERION FOR CONCEPT 
LEARNING: STAGE III 


Anxiety (A) 

Irrelevant cue 

Exposure (IC) 

Stage I positive cue 
present (PC 

oe 

Ax PC 

a. x PC 

AxKM MPC 

Within 


*p 01 
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large figure was clearly an easier cue. 
With no procedure available for 
scaling the difficulty of cues it seems 
reasonable to assume that this was a 
more novel cue 
attention value. 


generating higher 


DISCUSSION 


the motivational vari- 
able to predict differential acquisition 
and utilization of little 
comment. Strictly speaking, these re- 
sults are applicable only to this task, 
they raise further questions 
about the status of the MA scale as an 
independent selection device. 

While several studies (Bruner et al., 
1955: Jeeves Xx North, 1956: Peterson « 
1957; Weiss & Margolius, 
1954) have demonstrated positive trans- 


The failure of 


cues deserves 


although 


Peterson, 


fer of context or irrelevant stimuli, the 
present results suggest additional factors 
that demand attention. 
to initially irrelevant 
inhibit 
utilization of 


Here, exposure 
functioned 

facilitate 
when 
quently they became the basis for correct 
discrimination. 1958) and 
(1960) data 
showing the preference for, and attention 


cues 


so as to rather than 


these cues subse- 
Jerlyne 


Dember have presented 
value of, stimulus change and novelty. 
Luria (1957) and Sokolov (1954) 
found that orienting reactions are in- 
duced by changes in the stimulus field 
that serve to increase the accessibility 


deal 


[hese orienting responses are 


have 


of the organism to with these 
changes. 
maintained or extinguished as a function 
of the 


stimuli. 


relevance of new or changed 
Consequently, in the present 
study, despite the similarity of immediate 
start ol 


the stimuli constituting the 


negative reinforcement at the 
Stage III, 
positive instances were novel and atten- 
tion-compelling for the control Ss while 
for the experimental group they repre 
sented the familiar. Both the fact that 
53% of the control Ss reached criterion 
in the first 10 trials as opposed to only 
18% of the experimental group, and the 
greater ease of learning the cross-hatched 
ground 


cue are subject to the same 


interpretation. It might also be noted 
that the greater ease in learning the 
Stage III task where the positive Stage 
I cue was present as a negative instance 
fits well with this interpretation. This 
familiar cue constituted a clearly de- 
fined negative instance 50% of the time 
and thereby served to orient S to the 
card which a positive 
instance. Had the primary cue been 
randomly associated with either positive 
or negative instance in Stage III it would 
not have had the effect of 
novelty and, therefore, discriminability 


other was now 


increasing 


of the positive instance. 

Two further task variables are prob- 
ably crucial to the evaluation of the 
effects of the 
irrelevant 


acquisition of 
information. 


initially 
that 
positive transfer of previously irrelevant 
may (Babb, 1956; Bruner 
et al., 1955; Lawrence, 1949, 1950) only 
irrelevant 


One is 


cues occur 
stimuli are not 
competing in the same sensory field, e.g., 
a main visual task with irrelevant audi- 
tory Different 
would seem to reduce cue interference 
and the necessity of ‘adapting out” 
Restle, 1955) cues. The second factor 
is that irrelevant 
randomly present throughout a Stage I 
task, then systematically related 
to the positive cue in Stage II only for 
the experimental group, then the novelty 
advantage of these cues for the control 
group in Stage III should be 


where the 


cues. sensory channels 


where the cues are 


and 


nullified. 


SUMMARY 


\ three-stage concept-learning task was 
used to investigate the effects on subsequent 
cue utilization of, prior exposure to relevant 
differences in level of 


cues, motivation 


MA scale), and the shift of a previously 


positive cue to the status of 
stance 50% of the 
was in the 


a negative in- 
time The experiment 
form of a 2 X 2 X 2 factorial 
design with 10 Ss per cell 

The results showed no differences on 
any aspect of the concept-learning task as a 
function of level of motivation. The intro- 
additional (but 
initial 
learned, function so as to inhibit rather than 
facilitate the utilization of 


they later become the 


irrelevant 
discrimination has 


duction of 
alter an 


cues, 


been 


these cues when 


basis for correct 
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discrimination. A previously positive cue 
retained in a later learning task, where its 
reinforcement value is negative, facilitates 
the learning of a new discrimination. Resu!ts 
were discussed as suggesting an interpreta- 
tion in terms of stimulus novelty. 
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EFFECT OF 
TIME 


STIMULUS CONDITION AND REACTION 
INFORMATION ON SPATIAL 


STIMULUS 


GENERALIZATION ! 


CHARLES Y. NAKAMURA anp JAQUES W. 


University of California, 


Instructed voluntary response (VR) 
procedures for studying spatial gen- 
eralization (Brown, Bilodeau, & Bar- 
on, 1951) have recently received much 
attention. But as noted by Sherman 
and Knopf (1960) little has been done 
to define the variables that deter- 
mine error frequency, the measure 
of stimulus generalization (SG) in this 
procedure. Although there have been 
some attempts to generalize formula- 
tions derived from classical condition- 
ing situations to VR 
there little agreement 
results obtained from the two pro- 
cedures. One proposed for 
this is that conditioning studies of SG 
were in large part 
primary stimulus 
whereas the VR studies 
greater consideration to mediated 
generalization variables (Gibson, 1959; 
Mednick & Freedman, 1960). 


The concept of stimulus categorization 
appears to be a useful description of 
mediational processes involved in VR 
SG. Used this way categorization in- 
volves the assumption that if stimuli 
to which Ss are instructed to respond 
can be discriminated and conceptualized 
as belonging to a common class, this will 


situation, 
between 


the 


is 


reason 


with 
generalization, 


con erned 


must give 


1 This research was supported by Research 
Grant G-9589 of the National Science Founda- 
tion, and by grants from the Research Com- 
mittee of the University of California, Los 
Angeles. The authors wish to acknowledge 
the contributions of Marion Schulman, who 
assisted in the collection of the data, and 
Selma J. Brotsky, who assisted both in the 
collection and the data Parts 
of this article were presented in a paper 
read at the 1961 American Psychological 
Association convention in New York City. 


analysis of 


67 


KASWAN 


Los Angeles 


facilitate categorization of these stimuli. 
Where categorization facilitated, 
generalization to other stimuli in the 
same continuum will likely to 
occur than when categorization is more 
difficult. The similar to 
Wallach’s (1958) discussion of categori- 
zation as a determinant in SG-like 
havior, he implied that ease of 
discrimination of stimuli directly 
related to categorizability and inversely 
related to SG. A recent study by 
Evans (1961) provides direct evidence 
that spatial stimulus generalization is 
a.function of whether Ss can perceive 
the relative spatial location of the train- 
ing and test stimuli. 


is so 


be less 


reasoning is 
be- 
where 


is 


A major goal of the present study is to 
attempt the identification of some stimu- 
lus and necessary 
to specify the role of categorization in 
VR spatial SG. A brief description of 
the experimental situation will 
in making the following discussion of 
hypotheses more explicit. The stimuli 
are a horizontal row of 11 lights at eye 
level on a circular panel presented under 
three different stimulus conditions. In 
Cond. 5-6-7, Ss are instructed to press 


response variables 


assist 


a key only when Lights 5, 6, or 7 (posi- 


tive lights) come on. In Cond. 3-6-9, 
they are to respond only to Lights 3, 6, or 
9. In the third condition, essentially 
a control group, only the center Light 
6 is positive. 

Because of the spatial contiguity of 
Lights 5-6-7, Ss are expected to identify 
each as belonging to a group of three, 
thus facilitating categorization of these 
No grouping 
is likely to be discerned in Lights 3-6-9 
that of 
should difficult. 


(1957), of 


lights as positive. such 


so categorization these lights 


be more Johnsgard 


in a study stimulus-back- 
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ground contrast, presented a_ similar 
explanation of SG effects. 

This arrangement of training and test 
stimuli also permits the comparison of 
error frequencies predicted by the cate- 
gorization hypothesis and those predicted 
by Hull’s gradient summation hypothesis 
based on the exponential summation 
of overlapping gradients of generalized 
habit strength. Bilodeau, Brown, and 
Meryman (1956) reported support for 
the latter hypothesis. Kalish and Gutt- 
man (1957), on the other hand, found 
only partial support for the Hullian 
hypothesis both in their review of the 
Bilodeau et al. (1956) results and in their 
own research. Under the conditions 
of the present experiment, the gradient 
summation hypothesis would predict 
approximately the same number of 
errors to Lights 4 and 8 in both Cond. 
5-6-7 and 3-6-9. (Lights 4 and 8 were 
selected for specific comparison of errors 
because in both stimulus conditions they 
are located ideally in the spatial con- 
tinuum where there are at each negative 
light three overlapping gradients gen- 
erated from the training lights.) If, 
however, spatial contiguity facilitates 
categorization, fewer errors to these two 
lights would be expected in 5-6-7 than 
in 3-6-9. Moreover, in accord with the 
categorization hypothesis that the three 
positive lights in 5-6-7 are conceptualized 
as belonging to a single class, both the 
total number of errors and the error 
gradients in 5-6-7 are expected to be 
similar to those obtained in Stimulus 
Condition 6 where only the single light 
Number 6 is positive. Such a prediction 
would not be made from the gradient 
summation hypothesis. 

Specific predictions of response la- 
tencies also follow from the categoriza- 
tion hypothesis. On the assumption 
that reaction time (RT) generally in- 
creases as a function of stimulus com- 
plexity (Flores, 1956; Grebb, 1954) and 
that categorization is presumably a 
determinant in accuracy of perception 
(Bruner, Goodnow, & Austin, 1956, p. 9), 
it would be that accurate 
response latencies should be shorter in 
5-6-7 than in 3-6-9. 


expected 


The above predictions of errors and 
latencies generated from the stimulus 
categorization hypothesis produce, as 
logical consequents, some specific pre- 
dictions of error-latency relations. If 
latencies reflect difficulty of categoriza- 
tion of positive lights, Ss who do not 
take the required accurate 
categorizing should make more errors 
than those who take adequate time. 
Accordingly, if Ss can be induced to 
reduce response latencies they should 
make more errors. 


time for 


Since there is some 
evidence that knowledge of performance 
may reduce RT (Bilodeau & Bilodeau, 
1961, p. 250), giving Ss information 
about their speed of should 
reduce latency. Insofar as 5-6-7 laten- 
cies are expected to be close to simple 
RTs, further reductions in RT or in- 
crease in errors should be limited. In 


reaction 


3-6-9, however, greater absolute reduc- 
Accord 
ingly, greater amount of speed informa- 
tion should be 
differences 


tions in latencies are possible. 


associated with larger 

and smaller latency 
between the two stimulus 
conditions, and these should be attribu- 
table to the speed information effect 
in Cond. 3-6-9. 


error 
differences 


METHOD 
A pparatus.—The apparatus was a modi- 
fication of that used by Brown et al. (1951). 
he main component was a 180° semicircular 
which mounted a_ horizontal 
row of 11 lights separated by intervals of 4 
of visual angle, at eye level. 


panel on was 

The essential 
of visual angle 
with the lights at a distance of 30 in. from S’s 
eyes instead of the 8 


modification was the use } 


intervals at a distance 
of 60 in. employed by Brown et al. The 
lights, identified by numbering 1 through 11 
from left to right, were standard 6.3-v., .15- 
amp. miniature bayonet 
encased in milk-glass covers. 


indicator lamps 
\ similar orange 
colored light located directly above the central 
light, Light 6, served as both fixation point 
and ready signal. The panel was 3 ft. high, 
painted flat black, covered over the top with 
black cloth, and placed in a semidark room 
The S 
responded by pressing a telegraph key placed 
at his right hand (or left, if left handed). 


lrial intervals were regulated by an electric 


to minimize extraneous visual cues 
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The E, seated behind the panel and 
not visible to S, selected the stimulus light, 
randomly varied the foreperiod between 2, 3, 


motor. 


and 4 sec. on a Hunter electric interval timer, 
and read the RTs to the nearest .01 se 
a Hunter electric clock 

Procedure The basic 2 X 3 factorial de 


sign involved six sets of Ss under two stimulus 


from 


conditions and three RT information groups 
The groups under Cond 
to respond to the lighting of 
5, 6, or 7 and those in Cond 

3. 6, or 9 


instructed 
either Lights 
3-6-9 to either 
cted not 


5-6-7 were 


All the groups were instr 
to respond when any of the other lights were 
lit 

Within Cond. 5-6-7 and 
total ition rT 
information of 
trial and the partial information (PI) group 
fifth trial 


3-6-9, Ss in the 


intorn were 


group 


speed of response 


after every The Ss were told their 


performance was “good” (short latency 


“poor” long latency in equal 
proportion and E 
with Ss’ actual variatior 


on (NI 


“fair,” or 


1 


attempted to make this 


correspond 
latencies For the 
group, speed and 
equally and instructions to 
repeated 
told wher 

After 

light 
S was giver 
ited 


ina prede termil 


distrib 


intervals were 


the y light 


ligt 


onset ol 
training trials 

lights might ge 
to the positive lights (5, ¢ 


S was told that nov 


on but he was 


vendir 


lights dis 


positive 


three 
de termined random order 

In addition to Cond 
the basic design, a third stimu 
Cond. 6) was included 


structed to respond ol 


5-6-7 
where 


group was given total RT information 
to the other TI groups rhe 
booster trials given Light iring the 
series wa 

to the three positive lights i 


two stimulus 


num 

test 
oster trials 
1 of the other 
Subject } s were 130 students, both 
men and women, fro i course in i 
tory psychology. Within each of Cond 
and 3-6-9, 20 Ss were randomly assigned to 


each PI and J I group and 15 to each NI speed 


5-0-4 


Phe re 


information 
Cond 6 


group were 


RESULTS 


Errors. The mean percentage of 
errors to each of the negative lights 
are plotted for the different speed 
information in Fig. 1 
for 5-6-7 and Fig. 2 for 3-6-9. The 
total 


groups 


conditions 


analysis of variance of errors 
showed 
condition effect 
38.19, df 1/104, 
.001) with fewer total errors in 
5-6-7 3-6-9. The 
effect was also significant 

to Lights 4and 8 (F 
P < .001), 
5-6-7. This is in 


nearly 


over all information 
that the 
was significant (/ 


r « 


stimulus 


than in stimulus 
for errors 
14.91, df =1/104, 
the fewer errors in 
the 


with 
contrast to 
identical averages of the sum- 
mation hypothesis predictions to these 
two lights of 54% 


and 52% errors 


Fic. 1 Mean 
Cond. 5-6-7 to each 


percentages 


perce itage 


negative (test 

the sumn 
The latter 

around each of Li 

and 7 the gradient obtain 

Light ¢ 


predicted by 
hypothesis for the 
obtained by 


5, 6, 


erecting 
when 
was 


positive, and exponenti 


summating overlapping poit rhe 


wings of the single Light 6, also shown, 


displ aced one spatia nit away Ir 
so that there is correspondence of percenta 
of errors to each test light, for 


the dimension of 


ill gradients, o 
distance from the nearest 


positive light 
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expected for 5-6-7 and 3-6-9, respec- 
tively. (Lights 4 and 8 frequencies 
were combined in the analysis of 
variance since a chi square test of the 
difference between errors to these 
lights was not significant—x?=.90, 
df = 2, P > .10.) These results are 
consistent with the stimulus grouping 
hypothesis that the spatial contiguity 
of positive lights facilitated cate- 
gorization and led to the reduction 
of errors. Errors are clearly not 
attributable to a failure to dis- 
criminate lights as positive since Ss 
failed to respond appropriately to 
positive lights in only three trials. 
The hypothetical summation §gradi- 
ents in Fig. 1 and 2 were plotted by 
erecting around each of Lights 5, 6, 
and 7 and around Lights 3, 6, and 9 
the gradient obtained only 
Light 6 was positive. The values of 
the overlapping points, at each nega- 
tive light, were summated according 
to Hull’s formula for exponential 
summation of gradients (Hull, 1943, 
p. 200). The value 100 was 
stituted for M in the formula where 
M is the physiological limit of the 
learning process. In this experiment, 
the maximum percentage of responses 
to the positive lights was 100%. 


100 yg i 


when 


sub- 


ERROR RESPONSES 


PER CENT 


6 ? 6 =) 
STIMULUS LAMP NUMBER 


Mean percentage of errors in 
Cond. 3-6-9 to each negative (test 


Fic. 2. 
lamp, and 
percentages predicted by the summation 
hypothesis for the TI group. (The latter 
were computed as described for Fig. 1. 


60 


CONDITION 3-6-9 





MEAN ERROR RESPONSES 





we 


l l - 
NO PARTIAL TOTAL 

INFORMATION INFORMATION INFORMATION 
NI) PI) (TI) 


Fic. 3. Mean number of errors to Lights 4 
and 8 in two stimulus conditions and three 
speed information groups. 


Figure 1 shows that the error 
gradient of the TI group in Cond. 
5-6-7 closely approximated the gra- 
dient obtained in Cond. 6, also under 
TI. The mean difference between 
these sets of points was not significant 
(F=1.57, df=1/38), indicating that 
the number of stimuli to be cate- 
gorized did not affect generalization. 
Moreover, the large discrepancy be- 
tween summation hypothesis predic- 
tions and obtained error frequencies 
for the TI group in Cond. 5-6-7 indi- 
cates that summation does not ac- 
count for the results obtained by this 
VR procedure. The mean difference 
between these two sets of points 
was significant (F = 23.81, df = 1/8, 
P < .01).2, However, Fig. 2 suggests 

?In order to obtain scores for a variance 
estimate of the hypothesized gradient, 
necessary to the analyses of differences be- 
tween distributions, the summation procedure 
was carried out separately for Cond. 5-6-7 
and 3-6-9 by using the error scores, to the 
relevant lights, made by each S in Cond. 6. 
A statistical problem posed because several 
Ss in Cond. 6 had zero error scores could not 
be resolved by a transformation since the 
subsequent scores were to be exponentially 


summated. This was overcome by using 
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that in Cond. 3-6-9 the results ob- 
tained for the TI group approxi- 
mated fairly closely the predictions 
of the summation hypothesis. The 
nonsignificant F of 2.39 (df = 1/8) 
for the difference 
curves was consistent 
servation. 

The curves in Fig. 3 plot mean 
errors to Lights 4 and 8 as a function 
of speed information for Cond. 5-6-7 
and 3-6-9 and that greater 
amounts of speed information were 
associated with higher occurrence of 
errors (F=7.39, df=2/104, P<.01). 
The information effect was also found 
to be significant for total errors to all 
negative lights (/ = 12.15, df = 2/104, 
P< .001). A further ¢ test analysis of 
errorstoLights4and 8 at each informa- 
tion level showed that the stimulus con- 
dition effect occurred only where infor- 


these 
this ob- 


between 
with 


show 


the mean scores of Ss randomly combined 


into groups of 4, at the cost of reduction in df. 


60 - 


+— 


= 


IN SECONDS 


Soil 
——- 


mation was given (¢ = 2.56, df = 38, 
P < .05, for the difference between 
PI groups and f 3.24, df = 38, 
P < 1, for the TI groups). The 
corresponding difference between the 
NI groups was not significant. 

Latencies.—All the latency com- 
putations were based on the recipro- 
cals of the latency scores (Edwards, 
1960, p. 131) and the weight of the 
statistical analyses was almost wholly 
on the latency scores to positive 
lights. The latter was necessary 
since the latencies for errors were not 
suited to the usual statistical 
of differences between 
latencies for Ss 
from unequal numbers of 
because individual Ss made 
different numbers of errors to different 
lights. 


tests 
means since 
mean 
puted 
scores 


were com- 


Figure 4 shows the marked effect of 
stimulus condition on latencies to posi- 
tive lights at all levels of RT informa- 


3-6-9, MEAN LAMPS 3-9 
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(Mean latencies to positive lights are shown in boxes in the appropriate positions between 


negative lights 


The lowest box above Light 6 shows the combined mean latency of the three 


information groups to positive lights under Cond. 5-6-7 since the three group means were 


nearly identical.) 


tion. The F of 148.55 
for the test between 
to Lights 3, 6, 


(df 1 
mean 
and 9 and to Lights 
5, 6, and 7 of the test trials was signifi- 
(P < .001). Comparisons _be- 
tween Cond. 5-6-7 and 3-6-9 at each 
level of yielded ¢ test 
values significant beyond the .001 
level. The t's were 8.94, 6.41, and 
6.49 for comparisons the 
pairs of NI, PI, and groups, 
respectively. Figure 4 shows 
that the magnitude of the differences 
between stimulus conditions decreased 
with the giving of greater amounts 
of speed information 
that 


104) 


latencies 


cant 


information 


between 
TI 


also 


and indicates 


the locus of this effect was in 


Cond. 3-6-9. 
tial 


The apparent differen- 


effect of RT information on 


latency was confirmed by the signifi- 


Stimulus X Information inter- 

(F=4.47, df=2/104, P<.02). 
Analyses by ¢ tests of the differences 
NI vs. SI and NI vs. TI 
within Cond. 3-6-9 yielded t's of 3.05 
(P < .01) and 3.84 (P < .01), re- 
spectively; whereas, none of the ft’'s 
between information levels within 
Cond. 5-6-7 reached the .05 level of 
significance. 

It is seen in Fig. 4 that latency to 
Light 6 in Cond. 3-6-9 was shorter 
than the mean latency to Lights 3 and 
9 (F = 46.84, df = 1/104, P < .001) 


cant 
action 


between 
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and similarly for the corresponding 
difference between Light 6 and the 
mean for Lights 5 and 7 in Cond. 
5-6-7 (F=12.99, df =1/104, P<.001). 
It is probable that the shorter latency 
for Light 6 is attributable, in part, 
to the signal light above it 
served as a discriminative cue. 
ever, it is unlikely that this alone 
could account for the effects 
since the absolute mean latency to 
Light 6 differed stimulus 
conditions. It longest under 
Cond. 3-6-9, shorter in Cond. 5-6-7, 
and shortest in 
in Fig. 4. 


Comparison of 


which 
How- 


main 


among 
was 
Cond. 6, as shown 
the mean of the 
three mean latencies to positive lights 
for the TI group in Cond. 5-6-7, with 
the mean latency to Light 6 for Cond. 
6 vielded an F of 18.73 (df 1/38, 
P < .001). This finding contradicted 
the assumption that latencies to the 
grouped stimuli should approximate 
those obtained when a single light 
was positive. Hence, though the 
error gradients did not differ for these 
two stimulus conditions, as noted in 
Fig. that cate- 
gorizing three contiguous stimuli as a 
class is in fact difficult than 
distinguishing a single positive stimu- 
lus in a continuum of stimuli. 


1, the data sugvest 


more 


Figure 5 plots error response laten- 
cies to the negative lights for the 


three RT information groups within 


each of Cond. 5-6-7 and 3-6-9. The 
upper part of Fig. 5 shows that in 
3-6-9, latencies for PI and TI groups 
tend to be shorter than those for the 
NI group. In contrast, (lower part 
of Fig. 5) there were no substantial 
differences in latency gradients or in 
latency to individual 
function of speed 


lights as a 
information in 
Cond. 5-6-7, and error latencies were 
clearly much shorter than those in 
Cond. 3-6-9 with practically no over- 
lap. Also, error latencies were con- 


sistently much shorter than latencies 
to adjoining positive lights for all 
three information groups within Cond. 
3-6-9 and the differences were slight 
but in the same direction for Cond. 
5-6-7. 

Error-latency interaction. Com- 
parison of latencies to positive lights 
in Fig. 4 with errors for Cond. 5-6-7 
and 3-6-9 in Fig. 1 and Fig. 2, re- 
spectively, shows that although la- 
tency differences the two 
conditions decreased with greater RT 
information, differences in error fre- 
quency generally increased, largely 
because of the greater effect of speed 
information in 3-6-9 than in 5-6-7. 
Wherever RT information reduced 
latencies to positive lights, as in 3-6-9, 
error frequency increased. Where RT 
information had no substantial effect 
on such latencies, as in 5-6-7, there 


between 


was no significant difference in error 
frequency between the information 
groups within that stimulus condition. 

Further evidence of the interaction 
of errors and latencies was observed 
in that all except 1 of the 15 Ss in the 
NI group of Cond. 3-6-9 obtained 
mean latencies to positive lights of 
.5 sec. or greater, whereas, 19 of the 
40 Ss given RT information (PI and 
TI groups) in Cond. 3-6-9 had mean 
latencies shorter than .5 sec. Since 
the NI group obtained significantly 
fewer errors than the PI and TI groups 
in 3-6-9, the .5-sec. latency may be 
conceived as an optimum time for 
categorization of stimulus lights into 
positive and negative classes. Ac- 
cordingly, Ss responding faster than 
that should make more errors than Ss 
with latencies of .5 sec. or greater. 
When this comparison was made 
within TI and PI groups of 3-6-9, an 
F of 19.85 (df = 1/38, P < .001) was 
obtained. Moreover, comparison of 
error frequency of Ss above and below 
the median latency of .58 sec. within 
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the NI group in Cond. 3-6-9 also 
yielded a significant result (¢=2.26, 
df = 28, P < .05) for further indica- 
tion of the relation of shorter latency 
with greater errors. Similarly for 
Cond. 5-6-7, though neither error 
frequency nor latency had differed 
as a function of RT information, fre- 
quency of errors between Ss above 
and below the median latency. of .4 
sec., for the three RT information 
groups combined, was significantly 
different (F=4.19, df =1/53, P<.05). 

A third source of evidence for 
error-latency interaction was the nega- 
tive Pearson product-moment r’s be- 
tween frequency and latency of er- 
rors. For Cond. 5-6-7, the r’s for 
each group were NI=—.51, PI=—.41, 
and TI = —.50. Under Cond. 3-6-9, 
they were NI = —.48, PI = —.81, 
and TI = —.71. All except the r 
of —.41 were significant at the .05 
level or better. As expected, errors 
and latencies were most substantially 
correlated when RT information was 
given in Cond. 3-6-9, where stimulus 
categorization was presumably most 
difficult. However, the only signifi- 
cant difference (P < .05) between 
correlations between Cond. 5-6-7 and 
3-6-9 within any information level 
was that between the —.81 and —.41 
for the PI groups. 

When percentages of errors in 
Fig. 1 and 2 are compared with error 
latencies in Fig. 5, the direction of 
changes in magnitude from light to 
light are notably parallel. Though 
this is particularly true of Cond. 3-6-9, 
it is also seen in Cond. 5-6-7; where 
decreases in error are greatest, as from 
Lights 4 to 3 and from Lights 8 to 9, 
error latencies show a corresponding 
decrease in five of the six instances. 
While latency gradients beyond these 
points are somewhat irregular, varia- 
tions are within a 
suggesting an essentially 


narrow range 


horizontal 


distribution consistent with the flat 
error frequency gradients. Thus the 
results suggest that error frequency 
and error latency gradients are posi- 
tively related. 


DISCUSSION 


The assumption that stimulus condi- 
tions facilitate the categorization of lights 
as positive or negative was consistently 
supported. It seems, however, that the 
stimulus contiguity hypothesis does not 
entirely account for this facilitation. 
The persistent sharp drop in errors and 
latencies to lights adjoining Light 6 in 
Cond. 3-6-9 and the shorter latency to 
Light 6, compared to other positive 
lights, suggest that the central position 
of this light and (or) the signal light 
located above it served as discriminative 
cues which facilitated categorization 
of the adjoining lights as negative stimuli. 
It is possible that the central location 
of the positive lights and the signal light 
also played a role in Cond. 5-6-7, al- 
though the first negative light is two 
lights removed from the signal light. 
Occurrence of such effects had not been 
expected since Bilodeau et al. (1956), 
using a similar apparatus, found that 
error gradients and frequencies to single 
positive lights located 16° of visual angle 
from center were virtually identical with 
those obtained to a central positive 
light. Though this finding clouds the 
interpretation of results in terms of a 
stimulus pattern effect, the influence of 
the latter is apparent in the fact that 
latencies to Light 6 itself decrease from 
Cond. 3-6-9 to 5-6-7 to Cond. 6, sug- 
gesting an effect of stimulus arrange- 
ment on speed of categorization. 

Specification of stimulus arrangement 
and stimulus position as determinants 
of SG appear of special interest because 
this involves the problem of specifying 
the role of stimulus units in SG. 
review, Mednick and Freedman (1960) 
indicate that most SG studies follow 
an empirical formulation which assumes 
that SG is a decreasing function of 
increasing physical difference between 


In their 
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test and training stimuli. Hull (1943, 
p. 198) explicitly recognized the unten- 
ability of postulating arbitrary 
simple physical stimulus units and pro- 
posed the jnd as the stimulus unit for 
establishing SG gradients. 


such 


The present 
results re-emphasize that equal physical 
distance units from different positive 
lights do not correspond to equal decre- 
ments in SG in the VR paradigm. This 
was shown most explicitly by the com- 
parison of theerror frequencies obtained in 
Cond. 5-6-7 with the higher frequencies 
of Cond. 3-6-9, and between the obtained 
curves and the summation hypothesis 
predictions in each of these conditions. 
It was further demonstrated by the fact 
that error frequencies to lights adjoining 
Light 6 in Cond. 3-6-9 were much lower 
than errors to lights surrounding Light 
3 and Light 9, although the difference 
in physical distance was identical. The 
results indicate the advantage of a 
stimulus unit or scale that could take 
into the specification of the 
relation among stimuli. The proposal 
of an ordinal scale of stimulus values by 
Mednick and Freedman (1960) is an 
attempt in this direction. However, 
their approach would predict sharper 
gradients over the negative lights in 
Cond. 5-6-7 than obtained in the present 
study. 
rated from the training stimuli by only 
one ordinal unit, Mednick and Freedman 
would expect higher percentages of 
them, relative to the more 
peripheral lights, because their formula- 
tion does not take into account the effect 
of contiguity of the training lights. 

The overall results of this study show 
a close relation between and 
latency. The error-latency 
relation wherein Ss who took less time 


account 


Since Lights 4 and 8 are sepa- 


errors on 


error 
negative 


training stimuli 
generally had greater error frequencies 
indicates that errors were due largely 
to insufficient time taken (for accurate 
categorization) before responding. An- 
other aspect of the close relation between 
categorization and latency is observable 
in the positive relation between error 
frequency and error latency which is 
also found in most other VR SG studies 


before responding to 


(Gibson, 1939; Mednick, 1958; Rosen- 
baum, 1953). 

The manifold role of latency in the 
results of this study make it reasonable 
to suggest that latency may serve to 
specify both stimulus response 
anchors of the ‘categorization”’ 
used here to describe the verbal and other 
mediated processes which are undoubt- 
edly important determinants of SG in 
any VR paradigm. Stimulus referents 
of categorization can be specified by 
noting that the time taken for responses 
varied with the arrangement or position 
of lights, as in the different latencies for 
Cond. 5-6-7 and 3-6-9. Response refer- 
ents of categorization can be specified 
by noting that latency was a major 
determinant of error frequency under 
given stimulus arrangement conditions, 


and 


term 


as in the effect of speed information in 
Cond. 3-6-9. Thus, difficulty or ease of 


categorization is measurable in 


terms 
of latency required to make a correct 
response, under specified conditions such 
as, in this study, relative emphasis on 


induced by 
time information. 


speed providing reaction 


SUMMARY 


This study examined the effects of stimulus 
arrangement and induced 


on perform- 


experimentally 


differences in response latency 


ance in a spatial stimulus generalization 
situation involving voluntary responses. In 
Cond. 5-6 7, Ss were instructed to respond 
to any one of three centrally and contiguously 
located lights (Lights 5, 6, and 7) of 11 lights 
in a row, but to inhibit responses to all 
other lights. In Cond. 3-6-9, the positive 
lights were noncontiguous (Lights 3, 6, and 9). 
rhe results showed that error frequency and 
latency were smaller in Cond. 5-6-7 than in 
Cond. 3-6-9. It was also found that giving 
Ss information on speed of responses reduced 
latencies and increased errors in Cond. 3-6-9 
but had practically no effect in Cond. 5-6-7. 
These findings 
tions of 
from a 


were consistent with predic- 
stimulus generalization 
stimulus categorization 


generated 
hypothesis. 
Predictions of error frequency by this hy- 
pothesis were compared with those made from 
Hull's gradient hypothesis of 
stimulus generalization. The latter failed 
to predict any of the obtained points on the 
gradient for Cond. 5-6-7, but did approximate 


summation 





76 CHARLES Y. NAKAMURA 


most of the points obtained in Cond. 3-6-9, 
under conditions where Ss were given maxi- 
rhe 
consistent relationship found between error 
was interpreted as 
indicating that latency can be used to specify 


mum speed information. close and 


frequency and latency 
both stimulus and response anchors of the 
mediational process in spatial VR SG. 
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STIMULUS GENERALIZATION AS A FUNCTION OF 
THE FRAME OF REFERENCE! 


DAVID 


R 


rFHOMAS anp CHARLES G. JONES 


Kent State University 


In a study reported by Philip (1952) 
it was shown that the location of the 
original stimulus within a generaliza- 
tion test series modifies the shape 
of the obtained gradient. Philip re- 
quired his Ss to rank cards containing 
varying and 
blue dots along a greenne ss-blueness 
The frequency with which 
rank, Number 3, was 
subjectively attributed to the different 
cards constituted a gradient of gen- 
eralization for that value. A separate 
gradient was thus generated for each 
rank employed. 


prope rtions of green 


of ale. 


a given say 


] 


Philip systematical 


the pene ralization 


the le 
With 

six values) there was 
a tendency for judgments to accumula 
the center of the scale, the ‘‘cer 
effect’’ (Hollingworth, 1909 
reflected asymmetrical 
around 


ly varied 
ol 


shortest series ¢ mployed 


test series 
ir 
tral tendency 
This effect 
gradients 


was in 


stimulus values which were non- 


centrally placed in the series of generalization 
test stimuli 


the unusual nature of 
Philip’s (1952) procedure, the signifi- 


Because of 


cance of his finding with regard to 
generalization 
methods may 


other 
The 


purpose of the present study was to 


studied 


als 


by 
be questioned. 


assess the generality of Philip’s find- 
method for obtaining 
generalization gradients developed by 
Kalish (1958). 
more 


ing, using a 
The Kalish procedure 
typical of generalization 
studies in that Ss are first exposed to a 
single stimulus value and are sub 
sequently tested for their ability to 


is 


select the original from a randomly 


1 This research was supported in part by 
National Institutes of Health Grant M-4203. 


77 


This is in 
contrast to the Philip procedure in 
which absolute judgments of pre- 
dominating color are made without 
previous exposure to some standard 
stimulus value. It was reasoned that 
if the ‘‘central tendency effect’’ 
shown to distort measures involving 


presented series of stimuli. 


were 


retention as well as absolute judg- 
ment, of 


con- 


studies 
more 


relevance for 
would 


vincingly demonstrated. 


its 


generalization be 


METHOD 


The Ss were 50 undergraduate 
taken from introductory 
at Kent State I 
had normal color vision, 
the Dvorine 


men psychology 
All Ss 
as determined with 
1944) color perception test. 
The study employed a Skin- 
ner-type key pecking apparatus, modified for 
with human Ss. The box was approxi- 
mately in. 11 in. high, and 14 in. 
wide and was painted flat black. The front 
wall of the box was made of transparent 
Plexiglas so that Ss could clearly the 
pecking key. The S sat in a chair approxi- 
mately 24 ft. from the key, which was a 
circular plastic disc j in. in diameter. 

nation was provided by 
mionic 


courses niversity. 


Apparatus 


use 


15 long, 


view 


Illumi- 
a Cambridge Ther- 
Corporation monochromator, Model 
B, Series 1066, equipped with an Olympus, 
Model 201250 6-v., 5-amp. light source. The 
patch of color on the key was approximately 
4 mL. in luminance. The only 


other light 
night light’’ on E’s 
S 


in the room was a 7.5-w. 
ol black cloth 
from E. 

A telegraph key was used to measure S’s 
responses. The telegraph key was placed 
on the table next to the Skinner box, within 
easy reach of S’s right hand. It was wired 
so that its release would illuminate a signal 


side a screen separating 


light on E’s side of the screen, thus signifying 
a response. with 
an electrically operated shutter which inter- 
rupted the monochromator beam when E 
threw a switch. 


The box was equipped 
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ORIGINAL 
¢ STIMULUS 
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[\ 
aN 
ie 


28 


RESPONSES 


485 495 505 515 625 535 545 555 565 
WAVE LENGTH (My) 


Fic. 1. Generalization gradients of the 
five experimental groups. (The gradient of 
Group 1 is at the top of the figure with that 
of Group 2 directly below it, etc. Note that 
the value of the original stimulus is the same 
for all groups.) 


Procedure.—The Ss were divided unsys- 
tematically into five groups of 10 Ss each. 
Each S received the same instructions. They 
were as follows: 


This is an experiment in color perception. 
At the beginning of the experiment a 
specific color will be presented through the 
small hole in front of you. Try to keep this 
color in mind because you will be asked to 
identify it later. After 1 minute this 
color will be turned off and you will place 
your finger and press down on the tele- 
graph key in front of you. I will give the 
signal “ready” and a few seconds later a 
color will again be presented. You must 
decide whether this is the original color 
shown you at the start of the experiment. 
If it is, /ift your hand as rapidly as you can 


from the key. 
the key. 

I will say the word “‘ready”’ whenever I 
am about to present a color and you should 
be pressing the key at that time. We are 
going to try some practice trials. Now we 
are going to run through a series exactly 
as we would do if this were the real experi- 
ment. 


If it is not, keep pressing on 


“ 


At this point the instructions were inter- 
rupted and a stimulus of 600 my was presented 
for 1 min. Then S was presented with 610 
mu, 590 mu, 600 my, 620 mu, and 580 mu. 
Each test stimulus was presented for 5 sec. 
with 5 sec. intervening between presenta- 
tions. If S appeared to have understood 
the instructions, they were continued as 
follows: 


Now we are going to begin the experi- 
ment. Remember, try to keep the original 
color in mind and respond as rapidly as you 
can, lifting your finger only when the 
original color appears. Do not be dis- 
turbed, however, if you should respond to 
other colors. 


The Ss in all five groups were presented 
with the same original stimulus, 525 mu 
(a middle-green), for 60 sec. The groups 
differed only with regard to the series of 
stimuli employed in testing for generalization. 
In Group 1 the test stimuli were 485 mz 
through 525 my, in 10-my steps. For Group 
2 the range covered was 495 myu—-535 mu, for 
Group 3, 505 mu-545 my, for Group 4, 
515 myu-555 my, and for Group 5, 525 mug- 
565 mu. For each S the five test stimuli were 
randomized within a series and 12 different 
series were presented. The number of re- 
sponses made to the different test 
constituted a generalization gradient. 


stimuli 


RESULTS AND DISCUSSION 


In Fig. 1 are presented the generali- 
zation gradients of the five groups 
of Ss. It should be remembered that 
all groups were exposed to the same 
standard stimulus. In spite of this, 
the five gradients differ strikingly 
in a manner consistent with the find- 
ings of Philip (1952). The tendency 
to respond to stimuli closer to the 
center of the test series is so strong 
that the peak of the generalization 
gradient tends to be displaced from 
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the value of the original stimulus. 
Thus, a change of, for example, 10 mu 
from the original stimulus does not 
produce a fixed generalization decre- 
ment, but may result in no change in 
response strength or even in an incre- 
ment depending on the location within 
the range of test stimuli. It should 
be noted that only with Group 3, 
where the test stimuli were symmetri- 
cally distributed around 525 mu, did 
the peak of the gradient fall clearly 
at that value! 


The difference in gradient shape was 
shown to be statistically reliable in 
the following manner: A simple analy- 
sis of variance was performed to test 


for differences in the mean number 
of responses given to 515 my in Groups 
1, 2, 3, and 4. The result 
of 9.06 (df = 3/36, P <.01). A 
parallel analysis was performed for 
mean responses to 535 my in Groups 
2, 3, 4, and 5. 
of 8.41 (df = 3/36, P < .01). 


The purpose of this study was to 
determine whether the effect of 
tral tendency” on the generalization 
gradient is peculiar to the absolute 
judgment situation employed by Philip. 
It is safe that it is not. 
Indeed, the distortion of the gradient 
proved even greater in the present experi- 
mental situation than under the condi- 
tions of Philip’s experiment. 


“cen- 


to conclude 


These findings may be interpreted 
with reference to Helson’s (1947) theory 
of adaptation level. It may be argued 
that the series of test stimuli provides 
a frame of reference against which the 
memory trace of the original stimulus is 
judged. When test stimuli are presented 
which fall asymmetrically around the 
original stimulus, a change in the frame 
of reference may be assumed to result, 
culminating in a heightened tendency 
to respond to stimuli nearer to the center 
of the test series, thereby distorting the 
resulting generalization gradient. 

The significance of the “central tend- 


was an F 


The result was an F 


ency effect”’ for studies of generalization 
employing conditioning techniques re- 
mains an open question. We would 
guess, however, that the background 
of test stimuli would exert far less influ- 
ence in that situation than in the present 
one. In this study, S’s experience with 
the original stimulus was limited to one 
60-sec. exposure. The resulting limited 
familiarity with the stimulus makes 
extensive retroactive interference by 
the test series which follows more likely. 
In a conditioning situation, however, 
S has the opportunity through repeated 
exposure to become much more familiar 
with the value of the CS. 
familiarity with the stimulus should 
reduce the effect of the test situation 
which is later employed. Some direct 
evidence on this issue has been reported. 
Guttman (1956) discussed some pilot 
work with pigeons and an operant condi- 
tioning technique in which the asym- 
metry of the distribution of test stimuli 
appeared to have no effect on the re- 
sulting generalization gradient. More 
recent work from the Kent State labora- 
tory has tended to corroborate his find- 
ing. Of course the species-related and 
procedure-related differences between the 
pigeon and human studies are so great 
as to make these findings merely sug- 
gestive. needed to 
the role of familiarity with the CS in 
determining the influence of the ‘“‘central 
tendency effect’”’ on generalization. 


Greater 


Research is 


assess 


SUMMARY 


Five groups of 10 Ss each viewed a mono- 
chromatic light of 525 my (a middle-green) for 
60 sec., and then were presented 12 different 
random series of wave lengths under instruc- 
tions to respond only to the original color. 
The number of responses made to the different 
test stimuli constituted a gradient of general- 
zation. Group 1 was tested with the series 
525 my, in 10-my steps; Group 2, 495- 
my; Group 3, 505-545 mu; Group 4, 
5-555 mp; and Group 5, 525-565 mu. 

nly Group 3, with a central value of 525 mu, 
produced a generalization gradient with a 
definite peak at 525 my; in all other cases the 
peak of responding was displaced toward the 
center of the series of test stimuli. The extent 
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of this displacement varied directly with the 
degree of asymmetry of the test series around 
the value of the original stimulus. These 
results support the assumption that the gen- 
eralization test series serves as a frame of 
reference against which the memory trace of 
the original stimulus is judged. 
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The effect of reward magnitude on 
extinction is an unresolved question. 
Zeaman (1949) and Metzger, Cotton, 
and Lewis (1957) have found faster 
running early in extinction for animals 
receiving the larger acquisition re- 
ward, but convergence of the curves 
extinction progressed. This 
consistent with the assumption (Hull, 
1952; Spence, 1956) that the incentive 
motivation factor (K) adjusts rapidly 
but not immediately to a change in 
reward magnitude. On the other 
hand, Hulse (1958) and Armus (1959) 
have found faster running over a 
number of extinction trials for animals 
receiving the smaller reward. ‘This 
might be predicted from the depres- 
sion effect 


as iS 


found in some studies of 
magnitude change during acquisition 
(Spence, 1956), since no reward is a 
greater reduction from large reward 
than from small, but the generality 
All 
of the above findings were with 100% 
reinforcement in acquisition. Hulse’s 
study further complicated the picture 
by showing an interaction between 
magnitude and percentage of reward, 
with large reward producing greater 


of this effect is open to question. 


out at North- 
and was supported by 
the National 


1 This study was carried 
western University 
Grant G8706 from 


Foundation. 
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than small 
if rewards were intermittent during 
acquisition. 

The present study translates the 
question of the effect of reward mag- 


resistance to extinction 


nitude on extinction into a special case 
of extinction—discrimination reversal. 
In particular, it whether the 
interaction magnitude and 
percentage of reward found by Hulse 
in straight-alley extinction will also 
appear in reversal of a T maze habit. 

While making this test, it is possible 
in the same experiment to study the 
effect of reward magnitude on acquisi- 
tion. 


tests 


between 


A number of prior experiments 
suggest that larger rewards lead to 
faster learning of a discrimination if 
the noncorrection method is used but 
not if the correction method is used, 
either in a single-« hoice situation or 
in a multiple-unit maze. McKelvey 
(1956) this apparent 
interaction of magnitude and method 
directly and found no effect of magni- 
tude on correct choices with either 
method. However, he manipulated 
magnitude by varying the time that 
rats were permitted to eat rather than 
the amount of food available. Since 


investigated 


rats learn to speed up their eating 


is 


over a period of deprivation, it 
possible that his “large reward” and 
“small reward” groups 


may have 
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been eating nearly equal amounts at 
the end of acquisition. This possi- 
bility is strengthened by the fact 
that an initial difference in speed 
between the two groups disappeared 
by the end of acquisition. The 
acquisition phase of the present 
experiment incorporates the same 


basic design as McKelvey’s, but with 
reward magnitude defined by quan- 
tity rather than by eating time. 


METHOD 


Subjects —The Ss were 96 female albino 
rats of the Sprague-Dawley strain purchased 
from Holtzman Rat Company, Madison, 
Wisconsin. Eight additional Ss were dis- 
carded because of failure to run, refusal to eat 
the reward pellets, or E’s error. The Ss were 
between 72 and 86 days old at the beginning 
of training. 

Apparatus.—The Ss were trained in an 
enclosed, single-unit T maze. This consisted 
of a start box 10 in. long and 6 in. wide, a 
stem 48 in. long and 4 in. wide, arms 7 in. 
long and 4 in. wide, and goal boxes 13 in. 
long and 6 in. wide. An alcove for food 
opened off each goal box in such a way that 
the reward was not visible from the entrance 
to the goal box. The entire maze was 8} in. 
high, with sides and floor of plywood and lids 
of Plexiglas. The inside was painted flat 
black. A guillotine door separated the start 
box from the stem, and sliding doors sepa- 
rated the stem from the arms. A Standard 
Electric timer was started by S’s weight on a 
treadle just beyond the start box and stopped 
by S’s weight on either of two treadles just 
beyond the guillotine doors. 

Design.—All Ss received 48 acquisition 
trials followed by 24 reversal trials. The 
design was a factorial combination of four 
dichotomous variables: (a) percentage rein- 
forcement of the correct side during acquisi- 
tion (100% vs. 50% random), (6) magnitude 
of reward during acquisition 
vs. one), (c) acquisition training method 
(correction vs. forced-trial noncorrection), 
and (d) magnitude of reward during reversal. 
Thus there were 8 treatment cells of 12 Ss 
in acquisition and 16 cells of 6 Ss in reversal. 


(four pellets 


Half of the Ss in each cell were initially trained 
to the right side, half to the left. 

It was desirable to hold both number of 
acquisition trials and proportional number 


of acquisition reinforcements constant for 
both magnitudes and both probabilities of 
reward. For noncorrection Ss, this required 
the use of forced trials. Hence, noncorrection 
Ss received a combination of 16 free and 32 
forced trials in acquisition, so arranged that 
half of their acquisition responses were to 
each side. A given ordinal-numbered trial 
was either free or forced for all noncorrection 
Ss, but each S in a given cell had a different 
sequence of forced right and left turns. 
All trials by the correction method were free. 
Reversal training for all Ss was by the non- 
correction method with all trials free and with 
100% reinforcement. 

The above procedure had the effect that a 
correction S received twice as many rein- 
forcements in acquisition as a noncorrection 
S in the corresponding group. However, 
since method was of interest in connection 
with acquisition rather than with reversal, 
this difference was not crucial. Experiments 
on the role of incorrect responses in discrimi- 
nation learning suggest that this procedure 
should make correction and noncorrection 
groups more nearly comparable than if num- 
ber of reinforcements were held constant and 
number of incorrect responses allowed to vary. 

Procedure.—Seven days before the begin- 
ning of training, Ss were placed on a feeding 
schedule of 10 gm. of ground Purina chow a 
day, which continued throughout the experi- 
ment. On 6 of these 7 preliminary days, each 
S received 3 min. of handling and was allowed 
to eat four of the reward pellets. Water was 
always present in the living cages and the 
carrying cages throughout the experiment. 

In both acquisition and reversal, Ss were 
given six trials a day. Thus there were 8 
days of acquisition and 4 of reversal, succes- 
sive except for 1 nontraining day between 
Days 6 and 7 of acquisition. The reward 
pellets were 45-mg. Noyes pellets. At the 
start of each trial S was placed in the start 
box. When S was oriented toward the door, 
E opened it, permitting S to enter the stem. 
In the noncorrection procedure, after S 
entered either arm the sliding door was 
closed and S was confined for approximately 
15 sec. (If it was a forced trial, the door to 
the other arm was already closed.) In the 
correction procedure, if S entered the incorrect 
arm the door remained open and S was per- 
mitted to retrace, but when S entered the 
correct arm the door was closed and S was 
confined for 15 sec. In both conditions S 
remained in a carrying cage for about 30 sec. 
between trials. The Ss received their daily 
ration about $ hr. after training. 
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RESULTS 


Acquisition.—The number of 
rect choices was tabulated 
S in acquisition. 


cor- 
for each 
Since only the free 
trials were relevant for the noncor- 
rection Ss, only the corresponding 
ordinal-numbered trials were counted 
for the correction Ss. ‘Thus each 
acquisition score represented the num- 
ber of correct turns out of a possible 
16. Three failure to run 
(out of 1536 trials) were counted as 
errors. 


cases ol 


The effects of the three acquisition 
variables shown in the three 
pairs of curves in Fig. 1, each curve 
based on 48 rats. A triple-classifica- 
tion variance, with 
df = 1/88 for each F ratio, indicated 
that all three 


are 


analysis of 


main effects were 
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Acquisition , Reversal 
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Note 
significant. Acquisition faster 
for four pellets than for one (F=6.71, 
P = .05), for 100% reinforcement 
than for 50% (F = 16.86, P = .001), 
and for forced noncorrection method 
than for correction (F =7.38, P=.01). 
None of the interactions was signifi- 
cant, all four Fs being less than unity. 
The Hartley test did not indicate any 
heterogeneity of variance (Fimax. of 
3.64). Although there was a marked 
negative skew in the distribution of 
scores, it is unlikely (Boneau, 1960) 
that this had any substantial effect 
on the Fs. 

Reversal—The number of correct 
choices on the 24 reversal trials was 
tabulated for each S. Table 1 
the means and SDs of these 
scores for the 16 groups. These 
distributions were more nearly normal 
than those for acquisition, and the 
apparent heterogeneity of variance 
was not significant by Bartlett's test 
(B = 19.76, df = 15). 

Table 2 summarizes the quadruple- 
classification analysis of variance of 
these data. Although all four main 
effects were significant when tested 


was 


pre- 
sents 
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TABLE 2 
ANALYSIS OF VARIANCE OF NUMBER OF 


CoRRECT CHOICES IN REVERSAL 


Source 


Percentage reinforcement 


(%) 
Training method (M) 
Acquisition reward (KA) 
Reversal reward (KR) 


Gi 
< 


6.20* 
20.07*** 


266.667 
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330.040 
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10.667 
18.377 
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* Used as error term for z 
*P <.05. 
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es P < 001. 


against the within-cells variance, the 
three that involved acquisition vari- 
ables were overshadowed by inter- 
actions. Figure 2 shows the course of 
reversal learning classified in three 
different ways so as to clarify these 
interactions. The effect of acquisition 
variables on reversal may be sum- 
marized by saying that 100% cor- 
rection Ss reversed fastest and 50% 
large-reward Ss reversed most slowly. 
Reversal, like acquisition, was faster 
for the larger current reward. 

The number of errors prior to the 
first correct response in reversal was 
also tabulated and subjected to an 
the three 
This measure 
gave substantially the same results 
as the total correct reversal responses. 


analysis of variance on 
acquisition variables. 


The main effects of percentage and 
method fell short of significance, but 
the effect of acquisition magnitude 
remained significant at the .001 level. 
The interactions of percentage with 
reward size and _ percentage with 
method manifested the same patterns 
as before and were both significant 
at the .01 level. 
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Speeds.—Time were 
verted to reciprocals and analyses 
of variance computed. In a triple- 
classification analysis for the last four 
trials of acquisition, with 1/88 df 
for each F, the correction method 
gave faster running at the .001 level 
(F = 131.43) and larger reward gave 
faster running for correction-method 
Ss only (F for magnitude-method in- 
teraction = 4.70, P = .05). No other 
F was significant. Ina quadruple-clas- 
sification analysis for all 24 reversal 
trials, with 1/80 df for each F, Ss re- 
ceiving the larger current reward ran 
faster (F = 10.16, P = .01) and those 
trained with the correction method 
(F = 8.90, P= .01). No 


other F was significant. 


scores con- 


ran faster 
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Reversal learning for all Ss, classi- 
three different according to 
reversal variable (top) and acquisition vari- 
ables (middle and bottom). 


ways 
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DISCUSSION 


Acquisition.—The superiority of the 
forced-trial, noncorrection method, in 
spite of the smaller number of reinforce- 
ments that it provided, argues for the 
importance in learning of making non- 
rewarded incorrect responses. Not only 
did the correction Ss have less experience 
with the incorrect side, they also were 
able to obtain reward, although delayed, 
even when they made the wrong choice. 
The fact that the noncorrection Ss were 
forced half the time to the wrong side 
makes it difficult to compare these re- 
sults with those of other studies com- 
paring the correction and noncorrection 
methods. However, in emphasizing the 
importance of incorrect responses, this 
study agrees with others that have found 
better learning of a discrimination with 
a combination of rewarded and 
rewarded trials than with the 
number of trials all rewarded. 


non- 
same 


No support was found for the hypothe- 
sis that greater reward leads to faster 
spatial discrimination learning with the 
but not the correc- 


noncorrection with 


tion method. Larger reward gave faster 
there was 
no interaction between magnitude and 
method. The possibility that the present 
finding is due to the use of forcing in the 
method 
However, 


learning throughout, and 


noncorrection cannot be 


ruled 


out. since this experiment 


was begun, another finding disconfirming 
the hypothesis 


has bee n 


reported by 
(1959). 


Thus it appears that some other factor 


Lawson, Cross, and Tambe 


‘must be found to explain the discrepant 


prior findings on the relation of dis- 


crimination learning to reward magni 
tude. 

The more rapid acquisition with 100% 
than with 50% com- 


parable to the common, though by no 


reinforcement is 


means universal, finding with acquisi 
tion of a simple running response. 
The that a 
reward 50% of the time produced greater 
resistance to 


Reversal. finding large 


any of the 


than 
other three combinations of magnitude 
and probability is similar to Hulse’s 
(1958) finding on resistance to extinction 


reversal 


in a straight alley. However, the lack 
of any difference among the other three 
groups differs Hulse’s findings. 
The effect found here is consistent both 
with a partial reinforcement extinction 
effect, but only for the large reward 
condition, and with a greater persistence 
strongly rewarded 
but only for the 50° and 
hence is not completely consistent witl 


from 


of more 


responses, 


> condition, 


either of these principles. 
Training method had no appreciabl 
effect on reversal when total number of 
reinforcements held 
comparing 50% correction 


was constant by 
with 100% 
The interaction that 
and per- 
centage may reflect in part a tendency 
to prefer the less often experienced side 
Denny, 1957). Since the 100% correc- 
tion Ss made the 


turns in acquisition 


noncorrection Ss. 


was found between method 


number o 
(the non 
correction Ss being forced equally often 
to the two sides), this tendency should 
be greatest for them and should lead to 
more rapid extinction of the old response 
and acquisition of the new response, as 


largest 
correct 


was found. The partial reinforcement 


effect might then be invoked to explain 
why the 50% 


correction Ss were 


markedly slower in reversing 


100% correction Ss. According to this 
however, the 


should 


which was not 


combination of factors, 
50% 
reversed most slowly, 
the Thus this 


remains at least partly unexplained. 


noncorrection group have 


case. interaction also 

The faster reversal with larger reversal 
reward agrees with the finding of faster 
acquisition with larger reward. Since 
method of 
trials was 


the reversal training free, 
different from 
either of the acquisition methods, the 


superiority of the larger reward in this 


noncorrection 


condition increases the generality of the 
finding that larger reward leads to faster 
There 


discrimination learning. was no 


generalization decrement 


from the 


evidence of 


resulting change in reward 
magnitude between 


The 


reversal 


acquisition and ex- 
lack of 
reward 


tinction. interaction be- 


tween magnitude and 
any of the acquisition variables argues 


for the independence of acquisition and 
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reversal variables in their effect on 
reversal. 

Speeds.—Since the recorded running 
speeds include time in the stem plus 
time in the choice area, the speed data 
are difficult to interpret. The most 
striking finding is the superiority of the 
correction method. McKelvey (1956) 
also found greater speed and less ac- 
curacy with the correction method, and 
in his study this might be attributed to 
the greater consistency of reward with 
correction method. This explanation 
seems inadequate in the present study, 
however, since 100% reinforcement did 
not give significantly faster running than 
50%. It is also striking that the superi- 
ority of the correction group continues 
through even though all Ss 
were on noncorrection method during 
Since the noncorrection Ss 
received primarily forced trials in acqui- 
sition, it seems likely that forcing to 
the incorrect side, rather than non- 
correction as such, was the crucial factor. 
However, why the difference should 
remain after all animals were changed 
to free trials is problematic. 

In general the effect of magnitude is 
greater on choices than on speeds. In 
particular, noncorrection Ss made more 
correct choices in acquisition, but did 
not run faster, for large than for small 
reward. This argues against Pubols’ 
(1961) suggestion that the effect of 
magnitude on choices is mediated by the 
effect on speeds. 


reversal, 


reversal. 


SUMMARY 


Acquisition and reversal in a T maze were 
studied for 96 rats as a function of four vari- 
ables combined factorially: (a) 100% vs. 50% 
random reinforcement in acquisition, (0) 
one vs. four reward pellets in acquisition, 
(c) correction vs. forced-trial noncorrection 
method in acquisition, and (d) one vs. four 
reward pellets in reversal. All Ss received 
100% reinforcement and free-trial noncorrec- 
tion method in reversal. 

Acquisition of the correct response was 
faster for large reward, 100% reinforcement, 
and forced noncorrection method, with no 


interactions. Reversal was faster for large 
reversal reward, faster after 100% reinforce- 
ment with correction method than any other 
combination of percentage with method, and 
slower after 50% reinforcement with large 
acquisition reward than any other combina- 
tion of percentage with acquisition magnitude. 
Acquisition by the correction method gave 
faster running during both acquisition and 
reversal. Running was faster with larger 
current reward both during acquisition by 
correction method and during reversal. 


REFERENCES 


Armus, H. L. Effect of magnitude of rein- 
forcement on acquisition and extinction 
of a running response. J. exp. Psychol., 
1959, 58, 61-63. 

Boneau, C. A. The effects of violations of 
assumptions underlying the ¢ test. Psy- 
chol. Bull., 1960, 57, 49-64. 

Denny, M. R. Learning through stimulus 
satiation. J. exp. Psychol., 1957, 54, 62-64. 

Hutt, C.L. A behavior system. 
Yale Univer. Press, 1952. 

HutseE, S. H., Jk. Amount and percentage 
of reinforcement and duration of goal 
confinement in conditioning and extinction. 
J. exp. Psychol., 1958, 56, 48-57. 

Lawson, R., Cross, H. A., & TAmBe, J. T. 
Effects of large and small rewards on maze 
performance after different prior experi- 
ences with reward amounts. J. comp. 
physiol. Psychol., 1959, 52, 717-720. 

McKeE.vey, R. K. The relationship between 
training methods and reward variables 
in brightness discrimination learning. J. 
comp. physiol. Psychol., 1956, 49, 485-491. 

METZGER, R., Cotton, J. W., & Lewis, D. J. 
Effect of reinforcement magnitude and of 
order of presentation of different magni- 
tudes on runway behavior. J. 
physiol. Psychol., 1957, 50, 184-188. 

Pusots, B. H. The acquisition and reversal 
of a position habit as a function of incen- 
tive magnitude. J. comp. physiol. Psychol., 
1961, 54, 94-97. 

SPENCE, K. W. 
tioning. 
1956. 

ZEAMAN, D. Response latency as a function 
of the amount of reinforcement. J. exp. 
Psychol., 1949, 39, 466-483. 


New Haven: 


comp. 


Behavior theory and condi- 
New Haven: Yale Univer. Press, 


(Received June 30, 1961) 





Journal of Experimental Psychology 
1962, Vol. 64, No. 1, 87-93 


PAIRED-ASSOCIATE 


LEARNING 


UNDER SIMULTANEOUS 


REPETITION AND NONREPETITION CONDITIONS! 
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Several recent investigators (e.g., 
Clark, Lansford, & Dallenbach, 1960; 
Estes, 1960; Estes, Hopkins, & Croth- 
ers, 1960; Rock, 1957 ; Rock & Heimer, 
1959) have obtained evidence leading 
them to conclude that associative 
learning is an all-or-none rather than 
a gradual incremental process, and 
that repetition serves only to provide 
additional opportunities for such all- 
or-none be learned. 
However, none of these studies have 
been sufficiently free of defects in 
experimental design and procedure to 
preclude alternative interpretations, 
discussed at length elsewhere (Post- 
man, 1962; Underwood, 1961; Under- 
wood, Rehula, & Keppel, 1962), so 
that the theoretical of all-or- 
none vs. gradual association forma- 
tion remains a matter of considerable 
dispute. 


associations to 


issue 


The present paper reports a series 
of experiments designed to eliminate 
or minimize the influence of several 
inadequately controlled variables in 
the procedure originally introduced 
by Rock (1957), wherein performance 
of a repetition group receiving the 
same list of paired-associate items on 
each trial did not differ a 
modified nonrepetition group, for 
whom all pairs not correctly responded 
to on any trial were removed from the 
list and replaced by new pairs on the 


from 


1 Experiments I and II were reported at the 
April 1959 meetings of the Eastern Psycho- 
logical Association, Atlantic City, N. J.; 
Exp. III and IV were supported by a contract 
with the United States Office of Education, 
Department of Health, Education, and Wel- 
fare. The author indebted to Douglas 
Nelson for assistance with Exp. IV. 


is 


87 


following trial. Since groups learning 
under the repetition and nonrepeti- 
tion procedures may also differ in such 
factors as instructions and approach 
to the task (Brackett, 1961), or 
amount of interference produced by 
other pairs in the list (Brown, 1961; 
Clark et al., 1960), a within-Ss design 
was employed so that each S learned 
simultaneously under both conditions 
and therefore served his own 
control. To more adequately 
the role of repetition in association 
formation as distinguished from other 
processes involved in paired-associate 
learning (e.g., Underwood & Schulz, 
1960), an attempt was made to de- 
velop materials which provided a 
relatively pure and substantial case 
of association formation. ‘These ma- 
terials were also highly homogeneous 
and carefully calibrated so as to be 
equivalent in difficulty, in order to 
minimize the effects of selective 
elimination of the more difficult in- 
correct pairs on each trial, which has 
been shown to result in a significantly 
easier list under the nonrepetition 
condition (Underwood et al., 1962; 


Williams, 1961). 


as 


test 


EXPERIMENTS I AND I] 
Method 


The method and procedure for both Exp. 
I and II closely followed Rock (1957), except 
that the list for each S consisted of two equal- 
sized subsets of pairs representing the repeti- 
tion (R) and nonrepetition (NR) conditions 
All pairs of the R subset were presented on 
every trial throughout the experiment, where- 
as only those pairs responded to correctly 
on the preceding trial were retained in the 
NR subset, all incorrect pairs being removed 
from the list and replaced on the next trial 
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by new pairs. The 15 Ss in each experiment 
were given typical paired-associate instruc- 
tions with an added statement that new pairs 
might be introduced during the experiment, 
although the procedure effectively prevented 
S from distinguishing between the R and NR 
subsets, or from detecting the basis for intro- 
ducing new NR pairs. 

All learning materials were typed on 3X5 
in. cards and presented manually through a 
card-exposure device. Each trial began with 
the successive presentation of all pairs in the 
list for 3 sec. each with a 5-sec. interpair inter- 
val, followed by all stimulus terms alone in a 
different order at a 5-sec. rate while S at- 
tempted to recall the associated response, 
with a 30-sec. intertrial interval. To further 
guard against differentiation of the R and 
NR subsets, both were distributed evenly 
throughout the list on each trial. 

Experiments I and II differed only in the 
kind of learning material employed. In 
Exp. I, 84 pairs of nonsense syllables of 47- 
53% association value (Glaze, 1928) were 
selected so as to be maximally homogeneous 
in judged ease of learning according to a 
previous scaling procedure (Battig, 1959), 
and divided into 14 subsets of 6 pairs, matched 
as to mean and SD of these scale values. 
The initial list of 12 pairs for each S consisted 
of two of these equivalent 6-pair subsets 
representing the R and NR conditions, re- 
placement pairs for the latter being selected 
from another equivalent until ex- 
hausted, then from another subset, and so on 
until the final trial. Each of the 14 subsets 
was used equally often as the R and initial 
NR subset and at each point in the sequence 
of replacement pairs for the various Ss. 
Learning was carried out to a criterion of 1 
errorless trial or a maximum of 10 trials. 


subset 


Pairs of common 4-letter words as stimuli 
and 2-digit number responses were used in 
Exp. II, wherein each S learned to a criterion 
of one errorless trial an 18-pair list consisting 
of two 9-pair subsets of R and NR pairs. A 
total of 84 such word-number pairs were con- 
structed 


so as to avoid obvious intrapair 
associations or similarities, each being used 
equally often in the R and initial NR subset 
and at various points in the sequence of NR 
replacements. 

Each S in both Exp. I and II was given a 
recognition test immediately after the last 
trial, consisting of the individual presentation 
of 10 incorrect NR pairs which had been re- 
placed, intermixed randomly with 10 addi- 
tional pairs from the pool of 84 which S had 
never seen, with S indicating in each case 


F. BATTIG 


whether or not he thought he had seen the 
pair previously during the experiment. 


Results 


Means and SDs for the repetition 
(R) and nonrepetition (NR) condi- 
tions are presented separately in 
Table 1 for total errors over the six 
trials on which data were available 
from all 15 Ss of Exp. I, and for total 
errors and trials to the criterion of one 
errorless trial in Exp. II. Revealed 
herein is a marked and statistically 
significant difference in favor of Cond. 
R in Exp. I (¢ = 6.44, df = 14), 
whereas the slight superiority of 
Cond. NR for both measures in Exp. 
II fell far short of significance (both 
t's <1). However, the recognition 
test showed significantly above-chance 
recognition of incorrect NR_ pairs 
which was actually superior in Exp. I] 
(79.7% correct, tf = 7.76) to Exp. | 
(70% correct, t = 7.75), demonstrat- 
ing that something had been learned 
about the replaced incorrect pairs 
even in Exp. II where their removal 
did not retard NR learning. 

The results of Exp. I, 
1 of the 


errors, 


wherein every 
15 Ss made more NR than R 
may due to facilitation by 
of the substantial 
required for the nonsense- 
materials, and therefore are 
inadequate to disprove an all-or-none 


be 
repetition 
learning 
syllable 


response 


theory of association formation as dis- 
tinguished from paired-associate learning 
(e.g., Underwood et al., 1962). Although 
such response learning was minimized 
for the word-number pairs of Exp. II, so 
also was the amount of association for- 


mation significantly reduced, resulting in 


such rapid learning (63% correct re- 
sponses on Trial 2) that the performance 
measures become quite insensitive to any 
effects of repetition. Since these mate- 
rials also differed considerably in diffi- 
culty, thereby providing an important 
advantage for Cond. NR, the results of 
Exp. II cannot be interpreted as un- 
equivocally an _ all-or-none 
theory of association formation. 


supporting 
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EXPERIMENT II] 


To overcome the major deficiencies 
of Exp. I and I], an attempt was made 
to develop for use in Exp. III ma- 
terials which required a substantial 
amount of association formation while 
relatively free from other nonassocia- 
tive processes typically 
paired-associate learning. ‘This was 
accomplished by (a) constructing 
pairs with maximally different and 
unrelated stimulus and response mem- 
bers; (b) using stimuli of low meaning- 
fulness and familiarity which were 
highly discriminable from each other, 
since previous evidence indicates that 
discriminability represents a major 
factor in learning of the stimulus 
(Battig, Williams, & Williams, 1962), 
whereas meaningfulness and famili- 


involved in 


arity are of lesser importance (Under- 
wood & Schulz, 1960); (c) using 
highly meaningful and familiar re- 
sponses to minimize response learning. 


Method 


Each of 72 nonsense shapes of high dis- 
criminability and low association value was 
paired with a different two-digit number 
between 12 and 98 (excluding double num- 
bers and numbers ending in zero), forming 72 
pairs with minimal shape-number similarity 
according to the judgments of 10 preliminary 
Ss. Previous measures had been obtained of 
(a) association value and discriminability 
of each shape from 122 Ss; (b) association 
value of each number from 28 Ss; (c) rated 
“ease of learning’’ of each shape-number pair 
by 66 Ss; (d) actual learning difficulty in a 
separate study reported elsewhere, which 
describes the materials in more detail (Battig 
& Brackett, 1961). These 72 pairs were 
divided into 12 subsets of 6 pairs each which 
were approximately equivalent with respect 
to each of these four indices, each being used 
with approximately equal frequency as the R 
and initial NR subset and throughout the 
sequence of NR replacement pairs. 

All other aspects of method and procedure 
were identical to Exp. I with the following 
minor exceptions: (a) all materials were pho- 
tographed and presented for learning and at- 
tempted recall by means of an automatic 


slide projector; (b) a 5-sec. rate (4-sec. ex- 


TABLE 1 


PERFORMANCE MEASURES FOR REPETITION 
AND NONREPETITION CONDITIONS OF 
Exp. I-III 


Repetition 
Measure 
Mean 


Errors | 22.13 


Errors | 13.2 
Trials 4 


12.40 
4.20 


16.00 
5.80 


20.07 
6.67 


III | Errors 
Trials 


Note.—Errors and trials are to a criteric 
errorless trial, except for Exp. I which is based or 
1-6 for all Ss. 


posure, 1-sec. interslide interval) was used 
for presentation of pairs for learning as well 
as for stimuli alone during attempted recall, 
with a interval between trials and 
between presentation and recall series within 
each trial; (c) learning was continued to a 
criterion of errorless trials 
(although none of the 15 Ss ever made an 
error on the trial following the first errorless 
trial); (d) following the postexperimental 
recognition test for incorrect NR pairs, each 
S was shown again each of the 12 pairs he 
had learned and asked to describe verbally 
how he had learned them. 


Results 


30-sec. 


two successive 


As shown in Table 1, performance 
was superior on the R subset according 
to both error and trial measures, al- 
though significantly so only for errors 
(t = 2.83, P < .02) but not for trials 
(¢ = 1.85, .05 < P < .10). Further 
error analysis revealed the difference 
to be primarily due to significantly 
more response omissions under Cond. 
NR (¢ = 3.06, P < .01), which ac- 
counted for 29% of the NR errors as 
compared with 18% for Cond. R. 
The two did not differ 
significantly either in partially correct 
1.25), intralist 
1.34), or extralist 
1.05). Recognition test 
results were similar to Exp. | and II, 
yielding 71.7% correct identifications 


conditions 


intru- 
intru- 


responses (f 
(¢ = 
sions (¢ = 


sions 
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EXPERIMENT IV 


Ormmennmnd Repetition 
Coe Nonrepetition 


MEAN CORRECT RESPONSES 





4 





3 4 
TRIAL 

Fic. 1. Mean correct responses per trial 
for the modified repetition and nonrepetition 
conditions of Exp. IV on Trials 1-7. 


(¢=9.12). Further analysis revealed 
recognition errors to consist primarily 
of reports of incorrect NR pairs as 
not seen previously (85%) rather 
than of the unseen pairs as previously 
seen (15%). 

Based on postexperimental ques- 
tioning, the learned pairs were cate- 
gorized on the basis of S’s ability to 
verbalize a mediated basis for asso- 


ciating the shape and number to- 
gether, yielding insignificantly more 
mediated associations for NR (24%) 
than R pairs (20%). 


No basis for 
association could be verbalized for 
67% and 63% of the R and NR pairs, 
respectively, the remaining 13% in 
each case representing a ‘“‘doubtful”’ 
category. These results indicate that 
the present materials had been reason- 
ably successful in providing a rela- 
tively pure and substantial case of 
association formation, and were suf- 
ficiently homogeneous to minimize if 
not eliminate the problem of differ- 
ences in pair difficulty in favor of 
Cond. NR. 
EXPERIMENT IV 

Although the differences in favor 
of R over NR performance in Exp. II] 
were significant statistically, they 
were not impressively large in magni- 
tude, due at least in part to the 


relatively small number of pairs 
learned by each S, and the pre- 
dominance of already learned pairs 
on all but the first few trials as a 
consequence of the rapid learning 
of the shape-number pairs. In order 
to overcome these deficiencies and 
provide a more sensitive test of the 
role of repetition in this task, the 
procedure was modified in Exp. IV 
so that all pairs learned on each trial 
were eliminated from the list and 
replaced by new pairs. 


Method 


As in Exp. III, each S was presented with 
a list of 12 shape-number pairs on each trial, 
divided into two equivalent subsets of 6 
pairs representing repetition (R) and non- 
repetition (NR) conditions. However, all 
NR pairs were presented only for a single 
trial, being replaced after each trial by another 
of the equivalent subsets of 6 pairs which 
S had not seen previously. Pairs of the R 
subset were removed from the list and re- 
placed by new pairs only if they had been 
responded to correctly on the preceding trial, 
so that this subset on any given trial typically 
included a combination of new pairs with 
previously presented incorrect pairs. Trials 
were continued in this manner until the pool 
of 72 pairs was exhausted, which required 
seven-nine trials depending on the number of 
correct responses to R pairs for the various Ss. 

The only other differences in method and 
procedure from Exp. III consisted of (a) 
appropriate modifications in instructions to 
cover the changed conditions; (6) elimination 
of the postexperimental recognition test and 
attempted verbalization of correct associa- 
tions; (c) use of 20 paid Ss representing a 
wider range of previous experience and 
sophistication about paired-associate learning 
experiments than in Exp. I-III, each of which 
had used 15 volunteers from undergraduate 
psychology courses. 


Results 


The mean number of correct re- 
sponses per trial for pairs of the R and 
NR subsets are presented in Fig. 1, 
which indicates a consistent and 
increasing superiority of Cond. R to 
NR over the seven trials on which 
data were available from all Ss. 
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Total correct responses for Cond. R 
(Mean = 17.85, SD = 4.24) were sig- 
nificantly greater (¢ = 7.06, df = 19) 
than for Cond. NR (Mean = 11.65, 
SD = 4.96). Except for 2 Ss who 
performed identically under both con- 
ditions, all Ss made more correct 
responses on the R subset. The slopes 
of the R and NR curves of Fig. 1 were 
found to differ significantly by trend 
analysis of variance (F = 10.35, 
df = 1/38), due toa highly significant 
improvement over trials for Cond. R 
(F = 29.92, df = 1/19) whereas the 
slight increase in correct 
under Cond. NR fell far short of 
significance (F <1). In agreement 
with Exp. III, error analysis revealed 
significantly more omissions (f= 2.86, 
P<.01) for Cond. NR (Mean = 11.10) 
than Cond. R (Mean = 7.50), but 
the excess of partially correct re- 
sponses and intrusions in Cond. NR 
(Mean = 24.35) over Cond. R 
(Mean = 21.75) was not significant 
(t = 1.58, P > .10). 

Although the greater increase in 
correct over trials under 
Cond. R undoubtedly reflects the 
increased proportion of frequently 
presented incorrect later 
trials, a more direct and convincing 
demonstration of the superiority of 


responses 


resp ses 


pairs on 


performance on repeated pairs comes 


from a comparison of proportions of 
correct responses for R pairs varying 


TABLE 2 


PROPORTIONS OF CORRECT RESPONSES FOR 
REPETITION PAIRS AS A FUNCTION 
OF NUMBER OF PRIOR 
PRESENTATIONS 





Total number 

Correct responses 
Total proportion |. 30} .452) .485| .500 
Mean proportion 96) 581! .539 


' | 


in number of prior presentations. 
Table 2 presents the total number of 
pairs presented once, twice, etc. 
within the R subset summed over all 
Ss, along with the total number and 
proportion of these presentations which 
yielded correct and the 
mean proportion of correct responses 
averaged over Ss*. Revealed herein 
is a consistent increase with number 
of prior presentations in both total 
and mean proportion of correct re- 
sponses, the rate of increase being 
slightly less for total proportion due 
to the relatively greater contribution 
of slower learners to this figure 
as number of presentations increase. 


responses, 


The proportions of first, second, and 
third presentations responded to cor- 
rectly by each S were subjected to 
trend variance, which 
showed the increase with number of 
presentations to be highly significant 


(F = 24.56, df = 1/19). 


analysis of 


DISCUSSION 

The present results demonstrate con- 
clusively the facilitation of paired-asso- 
ciate learning by repetition of incor- 
rect pairs under conditions where (a) 
other which may differ for 
R and NR conditions have been elimi- 
nated through the use of a within-Ss 
design; (6) careful precautions have been 
taken to minimize the effects of differ- 
ences between pairs in difficulty; (c) 
nonassociative factors in paired-asso- 
ciate learning, such as response learning 


variables 


2 A somewhat more stable estimate of per- 
formance on once-presented pairs would 
probably be provided by the inclusion of pairs 
of the NR subset, all of which were presented 
only once. However, performance on NR 
pairs (p = .247) was somewhat below that 
for R pairs presented for the first time 

Db = .281), probably due to the differential 

distribution of these pairs over trials under 
the two conditions. The decision to base 
the present comparison solely on R _ pairs 
therefore represents, if anything, an overly 
conservative estimate of the magnitude of 
the increase in proportion of correct responses 
with number of previous presentations. 
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and stimulus discrimination, have been 
largely eliminated from the task. Only 
in Exp. II, which was clearly inadequate 
due to the extremely rapid learning of 
the word-number pairs and the lack of 
control for differences in pair difficulty, 
were the results not in direct conflict 
with an all-or-none theory, and even 
here the results of the recognition test 
showed that something short of a correct 
association had been learned about the 
eliminated incorrect NR pairs. 

In comparison with previous studies 
using the Rock (1957) procedure, Exp. 
III and IV would appear to be clearly 
superior with respect to the elimination 
of uncontrolled variables and 
favoring either the R or NR condition. 
However, despite the extensive efforts 


biases 


to equate the shape-number pairs in 
average difficulty, these pairs probably 
were not equally difficult for the indi- 
vidual S. Furthermore, besides being 
more difficult, the increasing superiority 
of performance on more frequently 
presented R pairs in Exp. IV may have 
obtained in spite of the relatively large 
contribution of slow learners to measures 
Since either or 


sources of 


based on these pairs. 
both of bias would 
clearly favor NR performance, the con- 
clusiveness of the 


these 


results in 
demonstrating the positive role of repe- 
tition only be 


present 


can enhanced thereby. 

Only if the position is taken that some 
process other than association formation 
has been facilitated by repetition can the 
results of Exp. III and IV be regarded 
as insufficient to disprove an all-or-none 


Any 


such argument, of course, reduces the 


theory of association formation. 


issue to empirical meaninglessness unless 


the association-formation process can 


be given precise operational specifi- 
While verbal definitions of as- 


sociation formation as distinguished from 


cation. 


suffer from 
considerable ambiguity, there seems to 
be reasonable agreement that this refers 
‘ig ‘“*hooking- 
up,” or “associating together”’ the stimu- 


paired-associate learning 


to the process of ‘connecting, 


lus and response members of each pair. 


Procedurally, efforts to provide the 


necessary pure case of association forma- 


tion have concentrated on the elimina- 
tion of response learning, reflecting the 
influence of Underwood's conception of 
paired-associate learning as a two-stage 
process consisting of response-learning 
and associative phases (Underwood et al., 
1962; Underwood, Runquist, & Schulz, 
1959; Underwood & Schulz, 1960). 
There can be little question but that the 
present results are directly applicable to 
this definition of association formation as 
paired-associate learning in the absence 
of response learning. 

However, although the present stimu- 
lus materials were designed to be maxi- 
mally discriminable, thereby eliminating 
another nonassociative process suggested 
to be involved in paired-associate learn- 
ing (Battig et al., 1962; Gibson, 1940), 
they differed from the letters or words 
typically employed as stimuli in related 
studies in their low meaningfulness and 
familiarity. Inasmuch as stimulus fa- 
miliarity has little or no effect on paired- 
associate learning while stimulus mean- 
ingfulness affects primarily the associa- 
tion-formation phase (Underwood & 
Schulz, 1960), the present shape-number 
materials would therefore appear to be 
maximally appropriate and sensitive as 
a test of the role of repetition in associa- 
tion formation, since they do not sub- 
stantially reduce, nor shortcut through 
already existing mediated associations, 
the amount of 
required of S. 


association formation 


Nevertheless, 
materials 


although the 
have eliminated 
associative processes previously 


present 
those non- 
identi- 
fied to be important in paired-associate 
learning, the possibility remains that 
other processes besides association forma- 
tion are stili involved in learning the 
shape-number pairs. unless 
or until such identified 
and given adequate operational specifi- 
cation, it can be concluded that associa- 
tion formation as presently defined is not 
an all-or-none process, but instead builds 
up gradually in strength through repe- 
tition. Moreover, in view of the rapid 
learning observed even under the present 
conditions, which were carefully designed 


How ever, 
processes are 


to maximize the required amount of 
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would 
that this process may constitute such a 


association formation, it appear 
small and insignificant part of the learn- 
ing involved in the typical paired-asso- 
task, that 

its all-or-none or 


ciate questions concerning 


gradual nature are 


likely to be of little consequence tor the 


general understanding of factors im 


portant in paired-associate learning. 


SUMMARY 


The effect of repetition of previously incor- 
rect pairs in paired-associate learning was 
evaluated in four separate experiments in 
which each S learned a single paired-associate 
list consisting of two equivalent subsets of 
pairs. In Exp. I-III, each of 15 Ss learned 
under conditions where the repetition (R 
subset consisted of the same pairs repeated 
on all trials, while pairs of the nonrepetition 
NR) subset were retained in the list only if 
responded to correctly and were otherwise 
removed from the list and replaced by new 
pairs on the next trial 
syllable pairs, 


Lists of 12 nonsense- 
18 word-number pairs, and 12 
pairs of nonsense shapes and numbers were 
used in Exp. I-III, respectively. 
ment IV (N = 20) tl 

number pairs as Exp. III, but 
tions where all pairs responded to 


Experi- 
used the same shape- 
under condi- 
corres tly 
from the list, 
so that the NR subset consisted of a new set 
of pairs on each trial, while th 
included both new pairs and previously 


were immediately removed 
R subset 
pre- 
pairs differ 
ences in favor of Cond. R were obtained in all 
cases except for Exp. II, which was attributed 
to the greater ease of 


sented incorrect Significant 


and rather 
wide range of difficulty of the word-number 
pairs. Particularly in view of the significant 
positive effects of repetition in Exp. III and 
IV, using materials requiring a relatively pure 


learning 


and substantial case of association formation 
which were also carefully calibrated to mini- 
mize the effects of differences in pair difficulty, 
it was concluded that the present results are 
directly contradictory to an all-or-none theory 
of association formation in paired-associate 
learning. 
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SUPPLEMENTARY REPORT: EFFECTS OF STIMULUS ASSOCIATION VALUE 
AND EXPOSURE DURATION ON R-S LEARNING! 


NED CASSEM anp DONALD H. KAUSLER 


St. Louis University 


Jantz and Underwood (1958) demonstrated 
a gross positive relationship between stimulus 
association value (AV) and the amount of 
R-S learning. However, they found the same 
reversal between 0% and 33% Glaze values 
that had been reported earlier by Postman, 
Adams, and Phillips (1955). As noted by 
Jantz and Underwood, the reversal most 
likely represented a sampling artifact in that 
only two syllables were included in their list 
at each AV level. The present study at- 
tempted to provide a more precise determi- 
nation of the relationship between the AV of 
stimuli and R-S learning by means of a more 
adequate sampling of stimulus items. 

In addition, the present study investigated 
the effects of exposure duration during both 
S-R learning and R-S recall on the amount 
of R-S recall. If, as proposed by Feldman 
and Underwood (1957), R-S learning is a 
variant of incidental learning, in which S 
learns associations for which he has no learn- 
ing set during a period in which he is set to 
learn then the amount 
of R-S or incidental learning should increase 
with increased exposure duration (Kausler & 
Trapp, 1961; Rosenberg, 1959). 

Method.—Eighty were ran- 
domly assigned to the 16 groups of a 2X24 
factorial design, representing exposure dura- 
tion during S-R learning (TL), exposure dura- 
tion during R-S recall (Tr), and AV. Each 
S had six trials on an eight-item list with 
nonsense syllables as stimuli and the same 
words employed by Jantz and Underwood as 
responses. Four sets of eight syllables were 
selected from the 0, 33, 67, and 100% Glaze 
lists, with the additional criterion that the 
values approximate Archer's (1960 
tion. 


other associations, 


seminarians 


revalua- 
Within each set similarity was mini- 
mized and four different combinations of S-R 
pairings were used to control for associative 
effects. Both Ty and Tr were either 2 sec. or 
4sec. In other respects the experiment repli- 
cated that of Jantz and Underwood (1958). 

Results —As found in the earlier study, 
total syllables correct during the recall trial 
displayed significant heterogeneity of vari- 


1 This study is based on a thesis submit 
first author to the Graduate School, St. Loui 
in partial fulfillment for the Master of legree 
Portions of this paper were presented ; the 1961 
meeting of the Midwestern Psychological Association 


ance between groups. The total letters cor- 
rect, a criterion used by Feldman and Under- 
wood (1957), indicated homozeneity of 
variance. Consequently, the latter measure 
served in the analysis of variance of treatment 
effects reported here. However, comparable 
results were obtained throughout for the 
heterogeneous total syllable scores. 

The overall means and SDs for AV were 
10.30 and 4.77, 14.70 and 5.29, 16.60 and 4.16, 
and 19.40 and 3.82 for 0, 33, 67, and 100%, 
respectively. A trend analysis for these 
means revealed a highly significant linear 
trend (F = 22.34, P < .001). The overall 
means and SDs for Ty were 12.52 and 5.17 
for 2 sec. and 18.20 and 3.92 for 4 sec. The 
corresponding Tr statistics were 13.90 and 
4.54 for 2 sec. and 16.83 and 4.55 for 4 sec 
The main effects for AV, T,, and Tr were all 
significant (F = 10.85, df = 3/64, P < .001; 
F = 26.98, df = 1/64, P < .001; F = 7.14, 
df = 1/64, P <.01). Of the interactions, 
only the Tri X Tr approached significance 
(F = 2.85, df = 1/64, P < .10). This trend 
suggested that Tr had little effect on the 
2-sec. Ty, groups but did on the 4-sec. Ty 
groups. That is, the 4-sec. recall condition 
was most effective in combination with the 


t-sec. 


S-R learning condition. 


[he present results appear to verify the 
assumption made by Jantz and Underwood 
(1958) that their reversal for 0% and 33% 
represented a sampling artifact. 
also indicate that 
to exposure duration during both the learning 
and recall periods. 


The results 
R-S learning is sensitive 
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SUPPLEMENTARY REPORT: EFFECTS OF INSTRUCTIONS ON EXTINCTION 


AND RECOVERY OF 


K. E. MOYER 


Carnegie Institute of Technology 


Lindley and Moyer (1961) found that, in 
the case of a classically conditioned finger 
withdrawal response, informing Ss just prior 
to extinction training that the UCS would no 
longer be delivered produced rapid extinc- 
tion. The present study replicated that 
study, but with a conditioned avoidance 
procedure, and in addition tested an implica- 
tion of the drive level (D) interpretation of 
their result; viz., that the appropriate instruc- 
tions could raise D level and lead to an in- 
crease in conditioned responding. 

Method.—The same experimental 
apparatus, intertrial stimulus intervals, method 
of adjusting shock level, and pretests with 
the tone alone were used (Lindley & Moyer, 
1961). The CS was a .5-sec. tone which pre- 
ceded a 1.5-sec. electric shock without overlap. 


room, 


rhe finger could only move up when the CS 
and UCS were presented. No shock was 
delivered if S responded to the CS and kept 
his finger raised } in. or more during the UCS 
interval. If S did avoid the 
moving the finger up } in. escaped the shock. 

The booth in which S sat was dark. The 
initial instructions were similar to the previous 
instructions except that the avoidance rather 
than the classical CR procedure was explained 
The S was told that we wanted to condition 
automatic movements of the finger but that 
he should not try to prevent his 
moving. 

A factorial involving number of 
acquisition conditioning trials (to four CRs 
in 5 consecutive trials, or that plus 20 trials), 
instructions prior to (Neutral 
or Inhibitory), and after 20 
extinction trials (No Instructions or Resume 
UCS) used Che Neutral 
prior to extinction were: ‘Be sure to let your 
finger jump up to the tone when it feels like 
it’’; the Inhibitory instructions were: ‘There 
will be no more shock presented from now on. 


not shox k, 


finger trom 


design 


extinction 
instructions 


was instructions 


I want you to try to prevent your finger from 
moving when the tone is presented.” After 
20 extinction trials half the Ss were given an 
additional 5 extinction trials without any 
instructions (No Instructions) ; the remaining 
Ss were told: ‘‘From now on the tone will be 
followed by shock on most of the trials. 
Remember to let your finger jump up to the 


1 We would like to thank James H. Korn for 


collecting 
the data for this study 


AND 


A CONDITIONED AVOIDANCE RESPONSE 


RICHARD H. LINDLEY! 


Trinity University, Texas 
tone if it feels like it”’ 
extinction procedure was continued for 5 
trials (i.e., no delivered). The 
normal intertrial interval was used between 
Trials 20 and 21 for the No Instructions 
groups; this interval was longer for the Re- 
sume UCS groups owing to the reading of 
the instructions. 

The 56 Ss (both men and women) from 
Carnegie Institute of Te hnology who reached 
criterion within 35 trials to 
the eight (N by a 


Twenty-seven 


(Rseume UCS The 


shoc k was 


were assigned 


groups per group = 7) 


randomized block procedure 
Ss were rejected ( 
criterion within 35 


ywwing to failure to reach 
trials, apparatus failure, 
or unwillingness to be shocked. 

Results.—The number of trials to 
reach the criterion of four CRs in five con- 
secutive trials was 10.98, SD = 7.16. 
were among 
groups in reaching this criterion (F 

In extinction, a CR 
that irred 


mean 


rhere 
the 
= 0.30). 
was defined as 
during the 


no signifi int differences 


response oct 
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duration of the CS. The variances of the 
number of CRs in extinction were highly 
heterogeneous (x?=62.53, df=3, P<.001). 
Owing to the heterogeneity of variance, the 
data of the groups that received neutral 
instructions and of the groups that received 
inhibitory instructions were analyzed sepa- 
rately and the x? median test was used at 
times instead of the more usual ¢ or F tests. 

A x? median test showed that the inhibitory 
instructions significantly reduced the number 
of CRs elicited in the 20 extinction trials 
as compared to the neutral instructions 
(x? = 23.17, df = 1, P <.001). Figure 1 
shows the extinction and recovery data for 
all groups and also the percentage of CRs on 
the first trial after Ss reached the ¢ criterion 
in the two groups with the ¢ criterion plus 
20 trials. The groups with the additional 20 
conditioning trials gave more responses in 
extinction than the groups that merely met 
the ¢ criterion (for the two groups with Neu- 
tral instructions, t = 1.27, df = 26, P > .05; 
for the groups with Inhibitory instructions, 
t = 2.18, df = 26, P < .05). 

The instructions designed to raise D level 
after 20 extinction trials had a significant 
effect on the groups that received Inhibitory 
instructions in extinction (median test 


x? = 8.58, df = 1, P < .01). The same in- 
structions had no demonstrable effect on the 
groups that received Neutral instructions 
in extinction (x? = 1.29, df = 1, P > .05). 

As in classical conditioning, inhibitory 
instructions dramatically reduced the number 
of CRs during extinction after avoidance 
conditioning, especially after the larger 
number of conditioning trials. This fits in 
with the notion that a habit factor is built 
up through reinforcement and is independent 
of a performance or drive factor which can be 
manipulated by instructions. The results 
also indicate that, for Ss who have been given 
inhibitory instructions, the appropriate in- 
structions can increase the number of CRs 
after extinction, presumably by raising D 
level. The latter result was not found for Ss 
who have been given neutral instructions in 
extinction, perhaps due to the already high 
level of responding of these groups after the 
20 extinction trials. 
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